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ABSTRACT 

We present new spatially-resolved Keck spectroscopy of early-type galaxies in three galaxy clusters at z « 
0.5. In two companion papers (van der Marel & van Dokkum 2006a,b) we construct dynamical models of the 
galaxies and compare their modeled M/L ratios and dynamical structure to local samples. Here we focus on 
the fundamental plane (FP) relation, and combine the kinematics with structural parameters determined from 
Hubble Space Telescope (HST) images. The galaxies obey clear FP relations, which are offset from the FP of 
the nearby Coma cluster due to passive evolution of the stellar populations. The z ~ 0.5 data are combined 
with published data for eleven additional clusters at 0.18 < z < 1.28, to determine the evolution of the mean 
M/Lb ratio of cluster galaxies with masses M > 10" Mq, as implied by the FP. We find d\og{M / Lg) / dz = 
-0.555 ± 0.042, stronger evolution than was previously inferred from smaller samples. The observed evolution 
depends on the luminosity-weighted mean age of the stars in the galaxies, the initial mass function (IMF), 
selection effects due to progenitor bias, and other parameters. Assuming a normal IMF but allowing for various 
other sources of uncertainty, we find = 2.01^o^7 for the luminosity-weighted mean star formation epoch. 
The main uncertainty is the slope of the IMF in the range 1 - 2Mq: we find z* = 4.0 for a top-heavy IMF 
with slope X = 0. The M /Lb ratios of the cluster galaxies are compared to those of field early-type galaxies 
at 0.32 <z < 1.14. Assuming that progenitor bias and the IMF do not depend on environment we find that 
the present-day age of stars in massive field galaxies is 4.1 %±2.0% (w 0.4 Gyr) less than that of stars in 
massive cluster galaxies. This relatively small age difference is surprising in the context of expectations from 
"standard" hierarchical galaxy formation models, and provides a constraint on the physical processes that are 
responsible for halting star formation in the progenitors of today's most massive galaxies. 

Subject headings: cosmology: observations — galaxies: evolution — galaxies: formation 



1. INTRODUCTION 

Recent studies of near-infrared selected samples have un- 
covered a large population of red, massive galaxies at z > 2 
(e.g., Labbe et al. 2003; Franx et al. 2003; van Dokkum 
et al. 2003; Daddi et al. 2004; Yan et al. 2004; Papovich 
et al. 2006). Red galaxies dominate the high-mass end of the 
mass function at 2 < z < 3 (van Dokkum et al. 2006) and 
are highly clustered (Daddi et al. 2003; Grazian et al. 2006; 
Quadri et al. 2006). Most of these objects are too faint in the 
rest-frame ultra-violet to be selected as Lyman break galaxies 
(e.g., Forster Schreiber et al. 2004; Reddy et al. 2005). The 
most straightforward interpretation of these newly-found ob- 
jects is that they are progenitors of today's massive early-type 
galaxies. 

Interestingly, a large fraction of these galaxies appear to 
have very high star formation rates, based on modeling of 
their spectral energy distributions, (stacked) X-ray emission, 
infrared emission, submm emission, and spectra (e.g., Forster 
Schreiber et al. 2004; Rubin et al. 2004; Daddi et al. 2004; 
Labbe et al. 2005; Knudsen et al. 2005; Papovich et al. 2006; 
Webb et al. 2006). A similar or smaller fraction is best-fit by 
passively evolving models with little ongoing star formation 
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(e.g., Labbe et al. 2005; Papovich et al. 2006; Kriek et al. 
2006b). These results suggest that by z ~ 2.5 we are entering 
the star formation epoch of massive early-type galaxies. 

An important question is whether the properties of early- 
type galaxies at intermediate redshift are consistent with this 
interpretation. Furthermore, detailed studies of early-type 
galaxies in different environments may identify descendants 
of particular populations of massive galaxies at z > 2. For 
example, it may be that the passively evolving galaxies at 
z ~ 2.5 are progenitors of cluster early-type galaxies whereas 
the star-forming galaxies are progenitors of early-type galax- 
ies in groups. 

One of the most sensitive tools for determining the star for- 
mation epoch of early-type galaxies is the redshift evolution 
of the fundamental plane (FP) relation (Djorgovski & Davis 
1987), as it reflects evolution in the M/L ratios of galaxies 
and has very low scatter The z > FP has been studied exten- 
sively in the past decade, both in clusters (e.g., van Dokkum 
& Franx 1996; van Dokkum et al. 1998a; Wuyts et al. 2004; 
Fritz et al. 2005; Holden et al. 2005; J0rgensen et al. 2006) 
and in the field (e.g., Treu et al. 1999, 2002, 2005a; van 
Dokkum et al. 2001; Rusin et al. 2003; van de Ven et al. 
2003; van der Wei et al. 2004, 2005; di Serego Alighieri 
et al. 2005). The results from these papers can be summa- 
rized as follows: 1) the M/L ratios of the most massive clus- 
ter and field early-type galaxies evolve slowly and regularly, 
indicating early formation of their stars; and 2) there is ev- 
idence that low mass galaxies evolve faster than high mass 
galaxies, both in the field and in clusters, suggesting that they 
have younger stellar populations. The interpretation of the 
first result is complicated by the small number of clusters that 
have been studied so far, uncertainties in the initial mass func- 
tion (IMF), and by selection effects due to progenitor bias: if 
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the morphologies of galaxies change with time the sample of 
high-redshift early-type galaxies is only a subset of the sample 
of nearby early-type galaxies, leading to biased age estimates 
(see van Dokkum & Franx 2001). The interpretation of the 
second result is complicated by the fact that large corrections 
for selection effects need to be made (see Treu et al. 2005a, 
2005b; van der Wei et al. 2005), and that galaxies are assumed 
to undergo no structural or dynamical changes with redshift. 

The goal of the present paper is to better constrain the star 
formation epoch of massive field and cluster galaxies, us- 
ing the FP. We present spatially resolved spectroscopic data 
from Keck and high-quality photometric data from the Hub- 
ble Space Telescope (HST) for a sample of early-type galax- 
ies in three clusters at z « 0.5. The FPs of the three clus- 
ters are presented and discussed, and combined with literature 
data on the nearby Coma cluster, a sample of nearby galaxies 
drawn from the Sloan Digital Sky Survey (SDSS), and eleven 
clusters at 0.18 < z < 1.28. The evolution of the M/L ratio 
derived from this large sample is used to constrain the star 
formation epoch of massive cluster galaxies. We combine 
the cluster sample with recently published samples of field 
galaxies, and determine the age difference between massive 
field and cluster galaxies using a self-consistent modeling ap- 
proach which is different from previous studies. In two com- 
panion papers (van der Marel and van Dokkum 2006a,b; here- 
after vdMvD06a,b) we utilize our spatially-resolved data to 
construct detailed dynamical models for the sample galaxies. 
The results are compared to those for local samples, providing 
an independent method of deriving M/L ratio evolution, and 
allowing us to vahdate the many assumptions that enter into 
analyses based on the FP. The dynamical models also yield 
a normalized measure of the rotation rate [akin to (v/cr)*] of 
the sample galaxies, providing a method to identify SO galax- 
ies that have been visually misclassified as elUpticals, based 
on their kinematics. 

The structure of the present paper is as follows. The anal- 
ysis of the spectroscopic and photometric data for the three 
z ~ 0.5 clusters is described in § 2 and § 3. In § 4 the FPs of the 
three clusters are presented and discussed. In § 5 results for 
the three clusters are combined with hterature data on 1 1 ad- 
ditional distant clusters and two local samples, and the M/Lb 
evolution is determined from this large sample. Detailed in- 
formation on the transformations of all the literature data to 
our system are given in an Appendix. The measured evolu- 
tion of the M/Lb ratio is interpreted in § 6. In this Section the 
imphcations for the star formation epoch of massive cluster 
galaxies are discussed, and the cluster data are compared to 
previously published data for field early-type galaxies. The 
main results are summarized and discussed in § 7. We assume 
= 0.3, r^A = 0.7, and Hq = 1\ km s"' Mpc"' where needed. 

2. SPECTROSCOPY 

2. 1 . Cluster Selection 

Galaxies were selected from the Small et al. (1997) 
[MORPHS] catalogs of the z w 0.5 galaxy clusters 3C295, 
CL0016H-16, and CL 1601H-42. These three clusters were se- 
lected based on their visibility at the time of our Keck ob- 
servations, and because they are among the most SO-deficient 
clusters in the MORPHS sample. The MORPHS sample itself 
was not selected according to strict criteria. Properties of the 
three clusters are listed in Table 1 . They span a range of a fac- 
tor of ~ 17 in X-ray luminosity. 3C 295 and CL 160lH-42have 
very high measured velocity dispersions, possibly indicating 



substructure along the line-of-sight. The dominant galaxy 
in 3C295 is a strong radio source. All MORPHS clusters 
have been observed with the HST WFPC2 camera, and bright 
galaxies in the WFPC2 fields were visually classified by the 
MORPHS team (Small et al. 1997; Dressier et al. 1997). 



TABLE 1 
Cluster Properties" 





3C295 


CL 0016+16 


CL 1601+42 


z 


0.460 


0.546 


0.538 




3.20 


5.88 


0.35 


a (km s~') 


1670 


1703 


1166 


F555W re.xp (ks) 




12.6 




F702W Texp (ks) 


12.6 




16.8 


F814W Texp (ks) 




16.8 




E fraction'^ 


0.51 


0.58 


0.33 


SO fraction^ 


0.21 


0.15 


0.12 


S fraction" 


0.28 


0.27 


0.55 



" Taken from Smail et al. (1997) 
* 0.3-3.5 keV; x lO^r^ ergs s"' 
" Morphological fractions from Dressier et al. (1997) 



2.2. Galaxy Selection and Observations 

The sample selection was largely constrained by the geom- 
etry of the Keck Low Resolution Imager and Spectrograph 
(LRIS) masks. Priority was given to visually-classified E and 
E/SO galaxies with Rjoi < 21.5 (3C295 and CL 1601+42) or 
/814 < 20.8 (CL 0016+16). For each cluster two masks were 
designed. The individual slits were tilted in order to opti- 
mally align them with the major axes of the galaxies, within 
the range allowed by geometrical restrictions. Some galax- 
ies are contained in both masks; for those objects we used 
position angles on the sky that are offset by 90 degrees, or 
the same position angle if the slit could be aligned with the 
major axis in both masks. A large, bright SO/Sb galaxy was 
included in one of the 3C295 masks, to test whether rota- 
tion can be reliably measured for an "obvious" disk galaxy. 
Remaining space in the multi-sht masks was used to observe 
candidate lensed galaxies and random fainter objects in the 
WFPC2 field. These secondary "fiUer" objects are not dis- 
cussed in this paper. 

The three clusters were observed with LRIS on 2001 18-19 
June. Conditions were variable, with part of the first night lost 
to clouds. The seeing varied from 0."7-0"9 during the run. 
For CL 00 16+ 16 and CL 1601+42 exposures were obtained 
for both masks; for 3C 295 only one mask was observed. The 
red arm of LRIS was used with the 900 lines mm"' grating 
blazed at 5500 A. The l."l wide sUts provide a spectral resolu- 
tion (as measured from sky emission lines) <Jinsa ~ 65 km s"' . 
The observed wavelength range is different for each object (as 
it depends on its position in the multi-sht mask), but is typi- 
cally 3500^800 A in the rest-frame. Total exposure times 
were 5400 s per mask for 3C 295 and CL 0016+16, and 9000 s 
per mask for CL 1601+42. The exposures were split into indi- 
vidual 1800 s exposures to facilitate cosmic -ray removal. The 
telescope was not moved in the sht-direction between suc- 
cessive exposures, as this would have moved the objects out 
of the (tilted) slitlets. Calibration exposures (dome flats, arc 
lamps) were obtained in daytime. Dome flats were taken at a 
range of zenith angles. 
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2.3. Reduction 

The reduction followed standard procedures for multi-slit 
spectroscopic data (see, e.g.. Kelson et al. 2000b; van 
Dokkum & Stanford 2003). Each slitlet in the masks was 
treated as a separate long slit spectrum. Bias was subtracted 
by fitting a low order polynomial to the overscan regions. The 
ccd was read out using two amplifiers; the bias of each am- 
phfier was fitted separately. Two bad columns were linearly 
interpolated. The dome flats were used to flat field the data. 
Before flat fielding, the dome flat exposures were divided by a 
polynomial fit in the A and spatial directions, as the illumina- 
tion of the ccd is different from that of the night sky emission. 

Bright cosmic rays were removed from each slitlet in the 
following way. A first order approximation of the sky emis- 
sion lines was subtracted by fitting a low order polynomial. 
This step reduced the intensity of the lines but did not re- 
move them completely because most slits are tilted. The sky 
line residuals and the galaxy spectra were largely removed by 
subtracting the median of the exposures. Remaining residuals 
were removed by fitting a polynomial, first in the spatial di- 
rection and then in the wavelength direction. This three-step 
procedure proved to be much more effective in removing the 
light of galaxies and night sky emission than simply subtract- 
ing the median. Cosmic rays were identified by comparing the 
counts to a noise model generated from the subtracted median 
image and the polynomial fits. Pixels deviating more than 4 cr 
were flagged as cosmic ray hits. 

Sky subtraction of tilted slits is difficult, as the standard fit- 
ting procedures require that sky lines are fairly well aligned 
with columns. Straightening the sky lines before subtract- 
ing them has the disadvantage of introducing aliasing effects 
due to the relatively large pixels of the LRIS ccd, which 
are very difficult to remove. More complex procedures have 
been developed which circumvent these problems (e.g.. Kel- 
son 2003); however, as we are working in a wavelength region 
mostly blueward of the ubiquitous OH lines and the galaxies 
are relatively bright, we found that an iterative approach pro- 
vided satisfactory results. At this point in the reduction we 
subtracted sky lines by simply fitting a low order polynomial 
in the spatial direction, masking the galaxy spectrum and any 
other objects in the slit. While not completely removing the 
lines, this step greatly reduces the aliasing effects that occur 
when resampling the lines after wavelength calibration. 

Wavelength calibration was done using arc lamp exposures, 
with all lamps on (Hg, Ne, Ar, Cd, and Zn). Line identifi- 
cation was done separately for each row, as the solution is a 
strong function of position for these tilted slits. The AA 6300.4 
O I sky line was used to apply small corrections to the zero- 
points of the wavelength solutions. The S -distortion of the 
spectra was determined by fitting the wavelength dependence 
of the position of the galaxy spectra. Two-dimensional poly- 
nomials were fitted to the measured positions of the sky lines 
and galaxy spectra, and these were used to rectify the spectra. 
In this procedure the spectra are resampled only once. Af- 
ter rectification, remaining sky line residuals were removed 
by fitting a polynomial (masking the galaxy spectra), and re- 
maining faint cosmic rays were removed with L. A. COSMIC 
(van Dokkum 2001). 

Spectra of the 27 target objects are shown in Fig. [T] They 
were created by averaging the central five rows (1"1), and 
have not been weighted, smoothed or binned. The sampling 
is 38kms~' pixel"', and dinstr ~ 65kms"'. The S/N varies 
from ^ 20 to ~ 60 per A. Note that the spectra are not flux- 



caUbrated. 

2.4. Redshifts 

Redshifts were determined using the cross-correlation pro- 
gram XCSAO in IRAF or, in one case, from emission lines. 
A nearby early-type galaxy was used as the template in the 
cross-correlation. The redshifts are listed in Table 2 along 
with the Smail et al. (1997) morphological classifications. 
Of 27 observed galaxies 25 are at the redshift of the cluster 
Of those 25, 24 have an early-type spectrum without strong 
emission lines. The exception is 3C 295-2014, which shows 
broad emission lines superposed on an early-type spectrum. 
The presence of AGN features in this object is not surprising 
as it hosts the bright radio source 3C 295, the namesake of the 
cluster 

The two galaxies with deviant redshifts are 3C 295-47 and 
CL 1601-270. 3C 295^7 is an emission-line object at z = 
0.1308. The lines are strong and narrow: the AA5007 [Olll] 
line has a rest-frame equivalent width Wx « 45 A, and is un- 
resolved at our resolution of w 65 km s"' . The galaxy resem- 
bles the Compact Narrow Emission Line Galaxies described 
by Guzman et al. (1996). CL 1601-270 is a field early-type 
galaxy ^ lO"' km s"' removed from the mean recession veloc- 
ity of CL 1601H-42. These galaxies are not included in the sub- 
sequent Figures and analysis. We note that galaxy CL 1601- 
270 is included in the modeling of vdMvD06a,b. 

2.5. Velocity Dispersions 

Internal velocity dispersions were determined from direct 
fits of the galaxy spectra to template star spectra. The fitting 
methodology is explained in detail elsewhere (see, e.g., van 
Dokkum & Franx 1996; Kelson et al. 2000b; van Dokkum 
& Stanford 2003). Briefly, the template stars were broadened 
such that their spectral resolution matches the instrumental 
resolution of the observations. Residuals of night sky lines, 
Balmer absorption lines and (in the case of 3C 295-2014) 
emission lines were masked in the fits. The wavelength re- 
gion used in the fit was 4 100 A -4650 A; changing the fit- 
ting region has a negligible effect on the measured disper- 
sions. For consistency with previous work (e.g., van Dokkum 
& Franx 1996) a KO giant star was adopted as the template 
star Varying the template star and the continuum filtering 
produces < 5 % variations in the dispersions. 

For each galaxy (except 3C 295^7) velocity dispersions 
were measured from the unweighted average spectrum of the 
five central rows, corresponding to a l."l x 1"1 square aper- 
ture. For consistency with previous work the measured dis- 
persions were corrected to the equivalent of a 3. "4 diameter 
aperture at the distance of the Coma cluster, using the em- 
pirical logarithmic correction given by J0rgensen, Franx, & 
Kjaergaard (1995b). For z ~ 0.5 the correction is approxi- 
mately 7 %. Corrected dispersions are listed in Table 2. The 
median random error is 8 %, and in every case the error is less 
than 15 %. 

2.6. Spatially Resolved Kinematics 

In vdMvD06a we present dynamical models of the galax- 
ies, derived from spatially-resolved kinematics and photome- 
try. Among other parameters, these models provide M/L ra- 
tios of the galaxies free of many of the assumptions that enter 
the FP analysis. The radial profiles of velocity and veloc- 
ity dispersion were determined in the following way. Veloc- 
ity dispersion measurements were made at 7 positions along 
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Fig. 1. — Keck spectra of all 27 targeted galaxies in the three cluster fields. The spectra are not binned or smoothed, and have a resolution 
of a « 65 km s"' . The vertical scale is arbitrary, as the spectra have not been calibrated. The spectroscopy shows that galaxies 3C 295^7 
(z = 0.1308) and CL 1601-270 (z = 0.5098) are field galaxies unrelated to the clusters. These two galaxies are not included in the subsequent 
figures and analysis. 
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Fig. 1. — (continued) 



the slit: the average of rows -7,-6,-5; the average of rows 
-4,-3,-2; row -1; the central row 0; row +1; the average 
of rows 2,3,4; and the average of rows 5,6,7. As the pixel 
size is 0."215 these positions correspond to -l."29, -0."64, 
-0."22, 0", 0"22, 0."64, and 1"29 from the center of the 
galaxy. Radial velocities were measured using XCSAO, as in 
some cases velocities can still be measured when the S/N is 
too low to measure dispersions. Dispersions were deemed un- 
reliable when the random error exceeds 20 %; velocities were 
discarded when the Tonry & Davis (1979) R-value is lower 
than 3. The resulting profiles are shown in vdMvD06a. 

3. PHOTOMETRY 

3.1. WFPC2 Images 

The three clusters were observed with WFPC2 on HST as 
part of the MORPHS cluster program. Details of the observa- 
tions and data characteristics are given in Smail et al. (1997). 
3C295 and CL1601H-42 were observed in one band (Rjoi) 
only; CL 0016H-16 was observed in two filters (Vsss and /gn). 



Total exposure times are listed in Table 1 . 

We obtained the raw data from the HST archive. For 
each cluster, the exposure time was divided over two posi- 
tions offset by w 2". The shifts are not integer numbers of 
pixels, and rather than combining the two positions we re- 
duced each pointing separately. This procedure has the advan- 
tages that no interpolation is required, and that we can assess 
the uncertainties in derived parameters from two independent 
datasets, with different sub-pixel sampling. The IRAF CRREJ 
task was used to combine individual frames at each position. 
Remaining cosmic rays and hot pixels were removed using 
L. A. Cosmic. 

Images of the 25 cluster galaxies are shown in Fig. |2] As 
expected, most of the galaxies have very regular morpholo- 
gies with no obvious spiral arms or other finestructure. There 
are three notable exceptions: 3C 295-568 is the SO/Sb galaxy 
added to test our ability to measure rotation curves; 3C 295- 
2014 (the central cluster galaxy) has structure in its core, 
very likely associated with its active nucleus; and CL 1601- 
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474 consists of two overlapping galaxies. Both components 
of CL 1601^74 show morphological disturbances, strongly 
suggesting this is a merger in progress. The object is flagged 
as "interaction/merger?" in the Smail et al. (1997) catalog. 
Interestingly the galaxy has an early-type spectrum, similar to 
the red mergers in the z = 0.83 cluster MS 1054-03 (Tran et al. 
2005a). 

3.2. Structural Parameters 
3.2.1. Fitting 

The galaxy images were fitted with two-dimensional mod- 
els, created by convolving r'/'^-law light distributions with 
point spread functions (PSFs). A separate PSF was used for 
each object, as the WFPC2 PSF depends on the position of 
the object on the chip. The PSFs were generated with the Tiny 



Tim 6.1a software (Krist 1995). The fitting procedure is de- 
scribed in van Dokkum & Franx (1996). Fit parameters in the 
minimalization are the x,y position, the effective radius r^, 
the surface brightness at the effective radius the position 
angle, the ellipticity, and the sky value. Nearby objects and 
the edges of ccds were masked in the fits. We inspected the 
residual images, created by subtracting the best fitting models 
from the data, to assess the quality of the fit. In most cases the 
model provides a good fit to the data; exceptions are the SO/Sb 
galaxy 3C 295-568 and the active nucleus 3C 295-2014. 

We empirically determined the errors in the fitted parame- 
ters by comparing the values derived from the two indepen- 
dent pointings, as the formal errors from the fit do not in- 
clude the effects of undersampling. The results are shown in 
Fig. 13 The uncertainties in r^, and Ig are highly correlated, 
and as is well known the correlation is almost parallel to the 
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relation 1^ cx r~'^ (solid line) which enters the FP (see, e.g., nitudes: 
J0rgensen, Franx, & Kjaergaard 1995a; van Dokkum & Franx 
1996; Kelson et al. 2000a). Not surprisingly the correlation 
is also well described by the expected relation for total flux 
conservation (4 cx r~^; broken line). The rms uncertainty in 
a single measurement is 0.012 in logrg, 0.018 in \ogIe, and 
0.003 in the FP combination logr^ + 0.81og/f. These uncer- 
tainties are negligible compared to the random errors in the 
velocity dispersions, and also to the systematic errors in the 
photometric calibration (see § 13.2.21 ). 



0.05 




-0.05 



-0.05 0.05 

log /-^(pos 1) - log (pos 2) 

Fig. 3. — Comparison of effective radii re and surface brightnesses 
le measured from two independent HST pointings. The errors in the 
two parameters are highly correlated. The broken line is the relation 



h oc r/, and the solid line is the relation Ig 
fundamental plane. 



which enters the 



B, = /?702 + 0.23(y555 - Rioi) + 0.74 
B^ = /8i4 + 0.30(y555-/814) + 0.89 

B, = Rjo2 + 0.05(y555 - Run) + 0.85 , 



(1) 
(2) 
(3) 



3.2.2. Calibration 

Measured effective radii were converted to arcseconds us- 
ing a pixel scale of 0."09993 pixel"' for the Wide Field cam- 
era, and 0"04555 pixels"' for the Planetary Camera. They 
were converted to kpc using our adopted cosmology. We note 
that the analysis in §|5] depends on the value of fl but not on 
the Hubble constant (see, e.g., van Dokkum et al. 1998a). 

Surface brightnesses were converted to the HST Vega sys- 
tem using the appropriate zeropoints for each of the WFPC2 
chips, as obtained from the WFPC2 Data Handbook,^ and cor- 
rected for Galactic extinction using the Schlegel, Finkbeiner, 
& Davis (1998) maps. The extinction is highest in the 
CL0016H-16 field (« 0.11 mag in the /gn band). We did not 
correct for the long vs. short anomaly, as our fields are not 
very crowded, or the position-dependent charge transfer effi- 
ciency (CTE), as the backgrounds are relatively high. 

For a meaningful comparison of galaxies observed at differ- 
ent redshifts the photometry has to be transformed to a com- 
mon rest-frame band. At z 0.5 the observed Rjo2 band is 
close to the rest-frame B band (denoted B,). Following the 
methodology of van Dokkum & Franx (1996) we derived the 
following transformations from observed to rest-frame mag- 

*See|http://www.stsci.edu/instruments/wfpc2AVfpc2_dhbAVFPC2_longdhbcovei(litMtJ5) (see S IA.l 



for 3C295, CL0016H-16, and CL 1601H-42 respectively. For 
the synthetic B band we used the BesseU (1990) BX filter. 
This filter is very similar to the Buser & Kurucz (1978) Z?2 
filter, which is used by Bruzual & Chariot (2003) for their 
stellar population synthesis calculations. 

Using templates from Coleman, Wu, & Weedman (1980) 
we find that the transformations are independent of spectral 
type to w 0.03. However, the colors that enter the trans- 
formations are obviously a strong function of spectral type. 
Measured colors are only available for CL0016H-16 (Small 
et al. 1997), as the other clusters were observed in only one 
filter For 3C295 and CL 1601H-42 we used synthetic col- 
ors derived from a redshifted Coleman et al. (1980) E/SO 
template, as the measured colors for CL0016H-16 are a good 
match to this template. We find Vsi^-Rioi = 1 .71 for 3C 295, 
and y555 -7;7,)2 = 1.81 for CL 1601 H-42. 

The uncertainties in the calibration are a combination of un- 
certainties in the WFPC2 zeropoints (w 0.03 mag), the long- 
short anomaly and CTE effects (sa 0.04 mag), the extinction 
(w 0.02 mag), and the conversion to rest-frame B magnitudes. 
The uncertainty introduced by the lack of measured colors is 
only a few percent for CL 1601 H-42, as the color term is small. 
However, the effect can be substantial for the lower redshift 
cluster 3C295: using an Sb/c template rather than an E/SO 
template gives B, magnitudes that are w 0.15 mag brighter. 
Hence we estimate that the typical uncertainty « 0. 10 mag for 
this cluster We assume that the systematic errors are uncor- 
rected, and can be added in quadrature (see, e.g., Freedman 
et al. 2001). The combined calibration errors in the surface 
brightnesses are « 0.11 mag for 3C295 and w 0.06 mag for 
CL0016H-16 and CL 1601H-42. 

4. THE FUNDAMENTAL PLANE 

The Fundamental Plane is an empirical relation between the 
effective radii r^, surface brightness at the effective radii 1^, 
and the velocity dispersions a of early-type galaxies of the 
form 

logr^ = alog(T + felog/(,-l-c (4) 

(Djorgovski & Davis 1987). The coefficients a and b depend 
on wavelength, sample selection, and fitting method (see, e.g., 
Bernardi et al. 2003a). J0rgensen, Franx, & Kjaergaard (1996) 
find fl = 1 .20 ± 0.06 and b = -0.83 ± 0.02 in the B band. We 
define log4 = /ip/-2.5, with /ip the rest-frame B band surface 
brightness at the effective radius in magarcsec"^, corrected 
for {l+z}* cosmological surface brightness dimming. We note 
that for an r'/'^-law fit the relation between /^p and {/ije, the 
average surface brightness within the effective radius, is given 
by /i, = + 1.393. 

4.1. Edge-on Projection 

The edge-on projections of the FPs in the three clusters are 
shown in Fig. ID Solid circles are E and E/SO galaxies, the 
solid triangle is an SO galaxy, the open square is a merger 
system, and the open circle is the SO/Sb galaxy that was added 
to test whether we can measure rotation curves. Small dots 
are galaxies in the nearby Coma cluster from J0rgensen et al. 
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Fig. 4. — Edge-on projection of the FP in the three clusters. Filled circles are E and E/SO galaxies, the filled triangle is the SO galaxy 3C 295- 
868, the open circle is the SO/Sb galaxy 3C 295-568, and the open square is the merger CL 1601-474. Dots are galaxies in the nearby Coma 
cluster. The solid line is a fit to Coma, and the broken lines show the best fitting offsets from the Coma FP for each cluster. 



The FP is well defined in each cluster: the elliptical galax- 
ies in the z « 0.5 clusters qualitatively follow very similar 
relations as the Coma galaxies. The FPs are offset with re- 
spect to Coma because of passive luminosity evolution of the 
stellar populations in the galaxies (see, e.g., van Dokkum & 
Franx 1996). The SO/Sb galaxy 3C 295-568 and the merger 
CL 1601-474 are outliers. The spiral galaxy falls below 
the relation defined by the other galaxies in 3C295, consis- 
tent with results for spiral galaxies in the z = 0.33 cluster 
CL 1358+62 (Kelson et al. 2000c). The merging galaxy falls 
above the CL 1601+42 relation, possibly because its velocity 
dispersion is overestimated due to the presence of light from 
its companion in the slit. Both galaxies were excluded from 
the analysis in §|5] 

4.2. Scatter and Tilt of the Fundamental Plane 

The galaxy samples in the individual clusters are too small 
to reliably determine the scatter in the FP. Hence the data for 
the three clusters are combined. For each galaxy the residual 
from the FP was calculated using 

AFp(logr,) = logr,-(1.201ogcr-0.831og/,+c)-0.461(z-0.024). 

(5) 

This procedure assumes that the intrinsic cluster-to-cluster 
scatter in the zeropoint of the FP is small compared to the 
galaxy-to-galaxy scatter within the clusters. The redshift 
term compensates for differential luminosity evolution over 
the redshift range spanned by the clusters, and the zeropoint 
c = -9.626 is that of the z = 0.024 Coma cluster (see §|5]l. 

The scatter is calculated using the biweight estimator and 
its uncertainty is determined from bootstrap resampling. For 
the full sample of E and E/SO galaxies (excluding the merger 
CL 1601-474) the scatter is 0.095 ±0.028 in logr,. This scat- 
ter is slightly higher than that measured in local clusters; e.g., 
J0rgensen et al. (1996) find an rms scatter of 0.071 in logr^. 
We use Monte Carlo simulations to determine whether the 
measured scatter can be fully explained by measurement er- 
rors (which are dominated by the errors in log cr). The er- 
rors imply an expected scatter of 0.048 ± 0.009, and we con- 
clude that the intrinsic scatter in this sample is 0.082 ± 0.028 
in logr^,. 

We investigate the cause of this scatter in Fig. |5] which 
shows the relation of the residuals from the FP with effec- 
tive radius, velocity dispersion, and mass. Masses were deter- 



mined using 



logM = 2 log 0- + log + 6.07. 



(6) 



The masses are approximate, as the constant in Eq. |6]depends 
on the details of the galaxy model, such as the brightness pro- 
file and the internal dynamical structure. There are clear sys- 
tematic trends in the z ~ 0.5 sample which are not present in 
the Coma sample. The residuals correlate with velocity dis- 
persion and with mass: a Spearman rank test gives a proba- 
bility of 99.8 % that Alogr^, and a are coiTelated, and a prob- 
ability of 96 % that Alogr^ and M are correlated. Correla- 
tions with logr^ and log/^ are not significant according to the 
Spearman rank test. When the sample is limited to galaxies 
having a > 200 km s"' the observed scatter in A log r^, reduces 
to 0.055 ±0.019; galaxies withM > 10" M© give a scatter of 
0.064 ±0.020. Both values are consistent with the scatter due 
to measurement errors alone, and we conclude that the intrin- 
sic scatter in the full sample of E/SO galaxies is caused by the 
galaxies with the lowest masses. 

Similar correlations were found by Wuyts et al. (2004) for 
a cluster at z = 0.58 and by J0rgensen et al. (2006) for two 
clusters at z = 0.83 and z = 0.89. Furthermore, recent studies 
of field early-type galaxies at 0.5 < z < 1.1 have found the 
same qualitative trends (Treu et al. 2005b; van der Wei et al. 
2005; di Serego Alighieri et al. 2005). The usual interpreta- 
tion of these trends is that they are due to a combination of 
selection effects and evolution in the "tilt" of the FP (see, e.g., 
Treu et al. 2005a). The evolution in the tilt is taken as evi- 
dence for differential evolution in the M/L ratio for galaxies 
of different mass, resulting from an underlying age-mass re- 
lation (with low mass early-type galaxies being younger than 
high mass early-type galaxies - see, e.g., Treu et al. 2005a; 
van der Wei et al. 2005). In vdMvD06b we show that these 
observed trends may be in part due to structural or kinematic 
evolution of low mass galaxies, as opposed to evolution of 
their M/L ratios. In the following we restrict the discussion 
to galaxies with masses > 10" Mq. As shown in vdMvD06b 
FP evolution is an unbiased measure of M/L evolution for 
galaxies in this mass range. 

Finally, we note that Kelson et al. (2000c) and Moran et al. 
(2005) do not find evidence for a change in the tilt of the FP 
for the clusters CLi358+62 at z = 0.33 and CL 0024+16 at 
z = 0.39 respectively, despite the use of large, high quality 
datasets. We also note that Wuyts et al. (2004) and Moran 
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Fig. 5. — Correlation of the residual from the FP with effective radius (a), velocity dispersion (b), and mass (c), for galaxies in the three 
clusters. Symbols have the same meaning as in Fig.|4] Galaxies with low masses or low velocity dispersions appear to systematically deviate 
with respect to the Coma cluster sample. 



et al. (2005) report an increased intrinsic scatter in the FP 
compared to nearby clusters even for high mass galaxies, 
which is not seen in our sample or in CL 1358+62 (Kelson 
et al. 2000c). The available evidence suggests that the FPs 
of intermediate redshift clusters show a range of properties 
when studied in detail, perhaps due to the stochastic nature 
of the growth of clusters (see, e.g., Tran et al. 2005b; Moran 
et al. 2005). Studies of individual clusters should therefore be 
interpreted with caution. 

5. EVOLUTION OF THE MEAN MIL RATIO 

5.1. Procedure 

Evolution of the zeropoint of the FP can be interpreted as a 
systematic change of the mean M/L ratio with redshift. The 
conversion of zeropoint offsets to offsets in M/L ratio as- 
sumes that early-type galaxies form a homologous family and 
that the FP is a manifestation of an underlying relation be- 
tween the M/L ratio of galaxies and other parameters (e.g., 
Faber et al. 1987). Starting from the empirical FP relation 
and assuming M cx cr^r^. and L cx /^.r^, this underlying relation 
is 

M/L oc a2+«/''r;('+'')/^ (7) 

(e.g.. Kelson et al. 2000c). The J0rgensen et al. (1996) B- 
band coefficients imply that M/L is largely a function of mass: 
M/L on M^^^rJ*"'^. Different coefficients imply different re- 
lations; e.g., using a large sample of galaxies drawn from the 
Sloan Digital Sky Survey (SDSS) Bernardi et al. (2003a) find 
that M/L correlates with effective radius with little or no de- 
pendence on a. 

Rather than determine relations between M/L and other ob- 
servables we follow the usual practice of determining the evo- 
lution in M/L directly from the fundamental plane offset with 
respect to z w 0: 

A log(M/L), = log(M/L), - log(M/L)o = (c, - CQ)/b. (8) 

This procedure assumes that the observed change in intercept 
of the FP is caused by evolution of the M/L ratio and that 
the coefficients a and b do not depend on the redshift. In 
vdMvD06b it is demonstrated that FP evolution thus defined 
yields an unbiased measure of M/L evolution for galaxies 
with velocity dispersions > 200 kms"' or masses > 10" Mq. 
In practice we use the Coma cluster of galaxies to define the 



zeropoint cq (see Appendix lAli. This choice is arbitrary, as 
adding a constant to all value of Alog(M/L), does not change 
the results of our subsequent analysis in any way. 

The coefficient for each cluster is determined by calcu- 
lating the residual from the FP for each early-type galaxy with 
M> 10"M(r,: 



c,^i = log - (a log 0- + /? log 4). 



(9) 



The cluster offset c~ is the biweight mean (Beers, Flynn, & 
Gebhardt 1990) of the distribution of c- ,. For consistency 
with J0rgensen et al. (1996) we use a = 1.20 and b = -0.83 
in Eqs. [8]and|9l As we show later the results are only very 
weakly dependent on the values of these parameters. For each 
cluster a random error and a systematic error is determined. 
The random error is the formal uncertainty in the biweight 
mean, or the expected uncertainty due to the eiTors in the in- 
dividual measurements, if the latter exceeds the former. As 
discussed in § l3.2.1l random errors in the velocity dispersions 
dominate over those in the structural parameters. The min- 
imum random eiTor in is therefore approximately 5a/ ^Jn, 
with 5a the average random uncertainty in the velocity disper- 
sion and n the number of galaxies in the sample. Systematic 
errors are described for each cluster individually below. 

5.2. Other Clusters 

Including the three clusters presented here there are now 
more than a dozen clusters in the redshift range 0.2 to 1 .3 for 
which FP measurements have been made. Furthermore, at low 
redshift the SDSS has provided important new information 
unhampered by uncertainties in peculiar motions and small 
number statistics. We combine published data with our own 
to compile a sample consisting of the Coma cluster, the SDSS 
sample, and fourteen distant clusters with homogeneous FP 
measurements. With one exception, the criteria for including 
distant clusters are 1) published data for individual galaxies, 
and 2) sufficient information to bring the data to a consistent 
system (i.e., colors, aperture corrections, etc). The exception 
is RX J1226+33 at z = 0.892: data of individual galaxies in 
that cluster are not yet published, but they were provided to 
us by I. J0rgensen. Details on the individual clusters and the 
derivation of systematic uncertainties are given in Appendix 
IaI and the offsets are listed in Table 3. The uncertainties in- 
clude neither the propagation of uncertainties in and JIa, 
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nor the propagation of uncertainties in distance due to cosmic 
variance in Hq (i.e., the expansion factor having somewhat 
different value when averaged over small scales than when 
averaged over large scales). This simplification is justified by 
the fact that these uncertainties are generally not the dominant 
uncertainty in the analysis (see vdMvD06b). 
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5.3. Observed Evolution and Scatter 

The observed evolution of the M/Lg ratio of massive clus- 
ter galaxies is shown in Fig.|6] There is a clear relation, with 
galaxies in clusters at high redshift having lower M/Lg ra- 
tios than those at low redshift, consistent with previous stud- 
ies of smaller samples (e.g., van Dokkum & Franx 1996; van 
Dokkumetal. 1998a; Kelson et al. 1997; Holden et al. 2005). 
The best-fitting linear function has a slope 

dlog{M/LB)/dz = -0.555 ±0.042, (10) 

and is indicated in Fig. |6] This evolution is stronger than 
inferred from previous studies that were based on smaller 
samples. For example, van Dokkum & Stanford (2003) find 
a slope of -0.460 ± 0.039 and Holden et al. (2005) find 
-0.426 ± 0.026. This difference is largely due to sample size: 
if we limit the sample to the six clusters from van Dokkum 
& Stanford or the seven clusters from Holden et al., we find 
slopes of -0.454 ± 0.052 and -0.463 ± 0.056 respectively, 
consistent with their results. We also note that our treatment 
of systematic errors is different from previous studies, which 
leads to slightly different fits (with different quoted uncertain- 
ties), even for the same data. 
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z 

Fig. 6. — Redshift evolution of the mean M/Lg ratio of clus- 
ter galaxies with masses M > lO" Mq in our composite sample. 
The solid line shows the best fitting linear function, with slope 
d\o^{M / Lb) / dz = -0.555. Dotted lines indicate the ±la uncertainty 
in the slope of the relation. 

The quoted uncertainty in the fit does not include all sources 
of systematic error. We first investigate the effects of the 
choice of coefficients of the FP (which have some uncertainty; 
see, e.g., Bernardi et al. 2003a; Cappellari et al. 2006). Vary- 
ing a from 1 .0 to 1 .5 produces best fitting slopes ranging from 
-0.564 to -0.542 and varying b from -1 . 1 to -0.6 gives slopes 
in the range -0.539 to -0.548. We conclude that the precise 
FP coefficients do not have a large effect on the derived red- 
shift evolution. 

Another source of uncertainty is the selection of the clus- 
ters. Some are selected in the optical (e.g., CL 1601H-42 
and CL0016H-16), others in X-rays (e.g., MS 2053-04 and 
MS 1043-03). Furthermore, the low redshift Coma cluster 
and Bernardi et al. sample may not be representative for the 
descendants of the (very massive) high redshift clusters in the 
sample. The data are insufficient to investigate the evolution 
of subsamples selected by X-ray luminosity, mass, or other 
parameters. However, we can assess the effects of removing 
datapoints, and determine whether the fit is driven by some 
individual clusters. Removing the Bernardi sample changes 
the best fitting slope by +0.009 to -0.546. Removing Coma 
changes the slope by -0.014. Removing both Coma and the 
Bernardi sample changes the slope by -0.005. The latter test 
effectively transfers the low redshift comparison point to the 
very rich z ~ 0.2 clusters Abell 665, Abell 2218, and Abell 
2390, which may be appropriate for the descendants of mas- 
sive clusters like MS 1054-03 at z = 0.83. All these changes 
are very small, and well within the la formal uncertainty in 
the fit. The cluster with the largest effective weight in the fit 
is RDCS 0848+44 at z = 1 .276: removing this cluster changes 
the slope by -0.052 to -0.607. The large influence of this sin- 
gle cluster highlights the importances of obtaining more FP 
measurements for cluster galaxies at z > 1 . 

The sample is sufficiently large to determine the scatter in 
the best-fitting linear relation. The of the fit is 20.4 with 
14 degrees of freedom, which implies that a linear function 
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is an adequate description of the data and that the scatter can 
be explained by the measurement uncertainties. The maxi- 
mum allowed intrinsic cluster-to-cluster scatter can be deter- 
mined by requiring that > 7.8, which for 14 degrees of 
freedom corresponds to a one-sided probability > 0.1. Itera- 
tively adding uncertainty due to intrinsic scatter in quadrature 
to the errors listed in Table 3 we find that < 7.8 for an 
additional uncertainty of > 0.057. The 90 % confidence up- 
per limit on the intrinsic cluster-to-cluster scatter is therefore 
0.057 in logCM/Lfl). 



6. IMPLICATIONS 
6.1. The Star Formation Epoch of Massive Cluster Galaxies 

The observed evolution of the M /Lb ratio of early-type 
galaxies depends on cosmological parameters, the star for- 
mation histories of the galaxies, their initial mass function 
(IMF) and metallicity, possible changes in dust content with 
redshift, and selection effects. Although the dependence on 
cosmological parameters can, in principle, be exploited (see, 
e.g., Pahre, Djorgovski, & de Carvalho 1996; van Dokkum 
etal. 1998a; J0rgensen et al. 1999; Lubin & Sandage 2001), 
we assume here that the evolution of galaxies and their stel- 
lar populations is currently more uncertain than the values of 
Vl„, and JIa (which are the relevant parameters). We also as- 
sume that dust is either absent in early-type cluster galaxies 
or non-evolving over the redshift range < z < 1.3. The star 
formation history is parameterized with a single age for the 
entire stellar population. This is almost certainly incorrect as 
most early-type galaxies likely have extended and complex 
star formation histories at early times (e.g., Nagamine et al. 
2005; Knudsen et al. 2005; Papovich et al. 2006). However, 
as shown by, e.g., van Dokkum et al. (1998b) the evolution 
of a complex stellar population can be well approximated by 
that of a single-age stellar population of the same luminosity- 
weighted age, provided that star formation has ceased > 1 Gyr 
before the epoch of observation. 

Figure |7] shows the predicted age-dependence of the M/Lg 
ratio for different assumptions about the IMF, the metallicity, 
and the implementation of horizontal branch stars in the popu- 
lation synthesis code. Solid lines are various Bruzual & Char- 
lot (2003) models, using the Salpeter (1955) and Chabrier 
(2003) IMFs and [Fe/H] ranging from Solai- to +0.56. Broken 
lines are various models of Maraston (2005), whose imple- 
mentation of the late stages of stellar evolution differs from 
that of Bruzual & Chariot (2003). The Maraston models that 
are shown are for Salpeter (1955) and Kroupa (2001) IMFs, 
metallicities ranging from Solar to +0.67, and four different 
implementations of the horizontal branch. All models were 
normalized in order to emphasize the predicted rate of evo- 
lution rather than the absolute M/L ratios at any given time 
(which vary greatly between the models). 

The predictions are very similar, demonstrating that the pre- 
dicted evolution is not very dependent on the metallicity, IMF, 
or particulars of the stellar population synthesis code. An im- 
portant caveat is that the Salpeter (1955), Kroupa (2001), and 
Chabrier (2003) IMFs, while different, were all derived from 
Galactic data, and the IMF of the (progenitors of) early-type 
galaxies may have been weighted more toward very massive 
stars (see, e.g., Larson 1998). We will return to this issue in 
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Fig. 7. — Evolution of the M/Lb ratio of a single age stellar pop- 
ulation, with arbitrary scaling. Solid lines are Bruzual & Chariot 
(2003) models with different metallicities and IMFs. Broken lines 
are Maraston (2005) models with different metallicities, IMFs, and 
implementation of horizontal branch stars. The predicted evolution 
is very similar in all models. The red line is the best-fitting powerlaw 
to a Solai- metallicity, Salpeter (1955) IMF Bruzual & Chariot (2003) 
model. 

As first shown by Tinsley (1980) the predicted evolution of 
the M/L ratio of a single-age stellar population can be approx- 
imated by a powerlaw of the form 

M/Locit-t^)'', (11) 
with the formation time of the stars. It can be deduced 
from the data shown in Fig. |7] that for any individual model 
this approximation is accurate to a few percent over the age 
range 9 < logf < 10. Fitting powerlaws to all these models 
gives values of k in the range 0.93-1.01. A Solar metallicity 
Bruzual & Chariot (2003) model with a Salpeter (1955) IMF 
gives a value of 0.97, and in the following we take k = 0.97 ± 
0.04. We note that the synthetic B filter adopted by Bruzual & 
Chariot (the Buser & Kurucz [1978] B2 filter) is very similar 
to the Bessell (1990) BX filter which we use. 

As emphasized by Franx (1993) and van Dokkum & Franx 
(2001) the observed evolution of the M /Lb ratio may under- 
estimate the true evolution because of selection effects. If 
galaxies undergo morphological evolution and transform from 
late-type galaxies into early-type galaxies at moderate red- 
shift (see, e.g.. Dressier et al. 1997) the youngest progen- 
itors of today's early-type galaxies drop out of the sample 
at high redshift. This "progenitor bias" leads to biased age 
estimates, as we trace only the oldest galaxies and not the 
full population of all progenitor galaxies. The significance 
of the effect can be estimated from the observed evolution 
of the early-type galaxy fraction in clusters and from the ob- 
served scatter in the color-magnitude relation and FP (see van 
Dokkum & Franx 2001). The maximum effect on the evolu- 
tion of the M/Lb ratio occurs if late-type galaxies are contin- 
uously transformed into early-type galaxies and the scatter in 
the color-magnitude relation is entirely due to age variations. 
In that extreme case the observed luminosity evolution under- 
estimates the true evolution by Apbiaslog(M/LB) ~ O.lz. The 
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true effect is probably smaller than the maximum value, espe- 
cially given recent evidence that the evolution of the early- 
type galaxy fraction out to z ~ 1 is weakest for the most 
massive objects (Holden et al. 2006). We conservatively as- 
sume Apbiaslog(M/LB) = (0.05 ± 0.05)z, which encompasses 
the full range of possibilities. 

With these assumptions and approximations the only free 
parameter is the time of formation of the stars f, (or the cor- 
responding redshift z*)- We determined the most likely value 
and its associated uncertainty using Monte Carlo simulations. 
In each simulation, each data point listed in Table 3 was per- 
turbed by a value drawn from a Gaussian distribution with a 
dispersion equal to the uncertainty. Next, progenitor bias was 
taken into account by decreasing all the measured logiM/Ls) 
ratios by p x z, with p drawn from a top-hat probability distri- 
bution bounded by zero and 0.1. Finally, Eq.[TT]was fit to the 
perturbed data to obtain z,, with the value of k drawn from 
a top-hat distribution bounded by 0.93 and 1.01. From 1000 
simulations we find that the mean luminosity weighted forma- 
tion redshift of the stars in massive cluster early-type galaxies 
z* = 2.01;!;o jy, where the uncertainties indicate the 68 % con- 
fidence interval. The best fitting model is shown in Fig. [8] Ig- 
noring progenitor bias (i.e., setting p = Q) gives z* = 2.23;!;q jg. 




time 

Fig. 8. — Evolution of the mean M/Lb ratio of massive cluster 
galaxies with time. Open symbols are the same datapoints as shown 
in Fig. |6] Solid symbols with errorbars are offset by -0.05 x z to 
account for progenitor bias (see text). The solid line shows the best 
fitting model for a Salpeter-like IMF, which has a formation redshift 
of the stars z, = 2.01. The broken line shows a model with a top- 
heavy IMF (slope X = 0) and a formation redshift z* = 4.0 (see §|7J. 



6.2. Comparison to Field Galaxies 
6.2.1. Data 

Several recent studies of the fundamental plane of field 
early-type galaxies have found that the most massive field 
galaxies out to z ^ 1 have similar M/L ratios as the most 
massive cluster galaxies (e.g., Treu et al. 2005a; van der 
Wei et al. 2005). However, di Serego Alighieri, Lanzoni, & 



J0rgensen (2006) find that massive cluster galaxies are much 
older than massive field galaxies, based on a comparison of 
data from Treu et al. (2005a) (for field galaxies) and J0r- 
gensen et al. (2006) (for cluster galaxies) at z 0.9. Here we 
quantify the difference between field and cluster early-types 
with M > 10" Mq using our sample of fourteen distant clus- 
ters and self-consistent modeling of the field and cluster data. 

The field samples of van Dokkum et al. (2001), van 
Dokkum & Ellis (2003), Treu et al. (2005b), and van der 
Wei et al. (2005) are used. Details are given in Appendix IB] 
Masses and offsets in M/ Lb ratio were calculated in the same 
way as for the cluster samples (see § 15.11 ). The sample com- 
prises 87 galaxies with M > 10" Mq, ranging in redshift from 
0.32 to 1.14. The sample was divided in bins of 10 galaxies 
(with the lowest and highest redshift bins containing 8 and 
9 galaxies respectively), and the central redshift and M /Lb 
ratio of each bin were determined using the biweight estima- 
tor The datapoints are listed in Table 3 and shown in Fig.|9l 
along with the cluster data (not corrected for progenitor bias). 
The low redshift point is determined from the Bernardi et al. 
(2003a) sample (see AppendixlAb. The M/L ratios of massive 
field and cluster galaxies are very similar 

6.2.2. Modeling Approach 

The M/L evolution of field and cluster galaxies can be de- 
scribed by 



{M/L)A^,=Ac{t-tcT' 
{M/L)f,M=Af(t-tfT' 



(12) 
(13) 



The standard approach for determining the star formation 
epoch of field galaxies is the same as for cluster galaxies: 
the rate of evolution of the M/L ratio is used to determine 
the luminosity-weighted star formation epoch tj, and Ay is a 
free parameter in the fit (see, e.g., Treu et al. 2002, 2005a, 
2005b; van der Wei et al. 2005; di Serego Alighieri et al. 
2005). However, this approach is not self-consistent when 
the evolution of field galaxies is compared to that of cluster 
galaxies. If field galaxies are younger than cluster galaxies, 
their mean M/L ratio evolves faster and is offset from that of 
cluster galaxies. A self-consistent model which describes the 
evolution of field and cluster galaxies has Af = Ac, and has 
three rather than four free parameters (ff, Ac, and f/). 

The contrast between self-consistent models and standard 
models is illustrated in Fig.[TO] The standard approach (bro- 
ken blue lines) allows models which have different formation 
redshifts for field and cluster galaxies to go through the same 
point at some redshift, in practice z ^ (see, e.g., Treu et al. 
2005b). The implication is that the difference in M/L ratio 
between field and cluster galaxies changes sign at z 0. This 
is not impossible, as it may be that field galaxies have a dif- 
ferent dust content, IMF, metallicity, or progenitor bias than 
cluster galaxies. However, this would mean that we live at a 
very special time, namely the only time in the past or future of 
the universe when one or more of these effects exactly cancel 
the difference in M /Lb ratio resulting from the age difference 
between field and cluster galaxies. It would also imply that 
Kf^Kc, as differences in A would in most models be accom- 
panied by differences in the rate of evolution. The approach 
followed in this paper is illustrated by the solid blue lines in 
Fig-Hni differences in age result in a different rate of evolu- 
tion and an offset in M/L ratio, such that the models converge 
at f = oo rather than intersect at the present age of the Uni- 
verse. 
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Fig. 9. — Evolution of the mean M/Lb ratio of field galaxies with 
M > lO" Mq from the literature (blue open symbols) compared to 
that of cluster galaxies with M > lO" Mq (red solid symbols). Thin 
horizontal lines show the range of redshifts covered in each bin. No 
corrections for progenitor bias were applied. The red line shows 
the best fitting model to the cluster galaxies. The broken blue line 
shows a fit to the field galaxies only, and the solid blue line shows a 
self-consistent fit to the field galaxies given a model for the cluster 
galaxies (see § 16. 2b . The age difference between massive field and 
cluster galaxies is small at ^ 4 %. 

z 




1.5 1 0.5 

time 



Fig. 10. — Illustration of different ways to model the evolution of 
the M/Lb ratio of field galaxies. The red line shows the evolution 
of a stellar population formed at z* = 2.23, appropriate for cluster 
galaxies. Blue lines show the evolution of field galaxies with z, = 1 .5 
(top) and z* = 1 (bottom) respectively. Broken blue lines illustrate the 
standard fitting approach, which allows models to intersect at z = 0. 
Solid blue lines show the self-consistent modeling approach adopted 
in the present study. 



6.2.3. Results 

We determine the star formation epoch of field galaxies tf 
given a model for cluster galaxies (f^ and Ac=A /). For sim- 
plicity we ignore progenitor bias: the relative age of cluster 
and field galaxies is insensitive to this bias as long as it is sim- 
ilar for both populations. Taking Zc = 2.23 for cluster galaxies 
and fixing the zeropoint of the model we find Zy = 1 .95^5) os foi" 
field galaxies, or an age difference of 4 % at the present day. 
Note that the quoted error does not include the uncertainty in 
the fit to cluster galaxies. Changing the amount of progenitor 
bias leads to negligible changes in this result, as long as the 
bias is the same for field and cluster galaxies. 

The significance of this difference is assessed using Monte 
Carlo simulations, which take the uncertainty in the fit to the 
cluster galaxies into account. In each simulation the M /Lg 
ratios of cluster and field galaxies were perturbed by values 
drawn from Gaussian distributions with dispersions equal to 
the uncertainties. Next the formation redshift of cluster galax- 
ies was determined, with the zeropoint of the model a free 
parameter in the fit. The formation redshift of field galaxies 
was also determined, with the zeropoint of the model fixed to 
the value that was derived from the cluster galaxies. Finally, 
the age difference was calculated from the formation red- 
shifts. From 1000 simulations we find that the age difference 
between massive field and cluster galaxies is 4.1 %±2.0%. 
Cluster galaxies are older than field galaxies in 98 % of the 
simulations. 

For completeness, we also determined the star formation 
epoch of field galaxies using the standard approach, i.e., with 
A/ as a free parameter. As expected from the curves in Fig. 
[TO]the best-fit formation redshift is lower: zj = 1.85;!;jj jj. This 
model is shown by the broken blue line in Fig.|9] 

7. DISCUSSION AND CONCLUSIONS 

In this paper we presented new measurements of kinemat- 
ics and structural parameters in three distant galaxy clusters. 
In vdMvD06a,b the spatially-resolved photometric and kine- 
matic profiles of the galaxies are used to measure the evolu- 
tion of the M /L ratio independent of many assumptions that 
enter the FP analysis. In the present study we focused on the 
FP, and the implications for the star formation epoch of the 
most massive galaxies in clusters and in the field. 

We find that the stars in massive early-type galaxies in 
clusters have a mean luminosity weighted formation redshift 
z* = 2.01;!{5'^7. The implied ages are somewhat younger than 
most previous studies, which were based on smaller samples 
(e.g., Holden et al. 2005). The quoted eiTor reflects a combi- 
nation of uncertainties in the data points, the stellar population 
synthesis models, and the significance of progenitor bias. In 
particular, varying the amount of progenitor bias from zero to 
the maximum allowed by the van Dokkum & Franx (2001) 
models [~ O.lz in Alog(M/LB)] leads to a range in z* of 
2.23- 1.84. A special form of progenitor bias is "dry" merg- 
ing, i.e., (nearly) dissipationless mergers of early-type galax- 
ies (e.g., Tran et al. 2005b; van Dokkum 2005; Bell et al. 
2006; De Lucia et al. 2006). Numerical simulations suggest 
that these mergers preserve the edge-on projection of the FP 
relation (e.g., Gonzalez-Garcia & van Albada 2003; Boylan- 
Kolchin, Ma, & Quataert 2006; Robertson et al. 2006), which 
would imply that our analysis is insensitive to this effect. 

The main source of uncertainty that our analysis does not 
(fully) take into account is the form of the IMF. The rate 
of evolution, as parameterized by k in our analysis, is very 
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dependent on the logarithmic slope of the IMF in the range 
1 -2Mq. Tinsley (1980) finds Ak - -O.SAjc, where x is the 
slope of the IMF, and the more recent Worthey (1994) mod- 
els give Ak ~ -0.22Ajic. Although we explored the values 
of K for most commonly used IMFs (Salpeter 1955; Kroupa 
2001; Chabrier 2003) these are all very similar in the region 
around one Solar mass. These IMFs are probably appropriate 
for star formation in present-day disk galaxies, but may not 
be applicable to (the progenitors of) early-type galaxies. As 
is well known the large mass of heavy elements observed in 
the hot gas of massive clusters, the abundance ratios of early- 
type galaxies, numerical simulations of star formation at very 
high redshift, and other lines of evidence suggest that the IMF 
may have been top-heavy at early times (see, e.g., Worthey, 
Faber, & Gonzalez 1992; Larson 1998; Schneider et al. 2002, 
and references therein). The top-heavy IMF preferred by Na- 
gashima et al. (2005) has x = (where x= 1 .35 is the Salpeter 
[1955] slope), which implies = 1.4. This value of k gives a 
best fitting formation redshift of z* = 4.0, higher than the z^2 
range derived for Salpeter-like IMFs. This model is shown by 
the broken line in Fig. [8] We note that our observations probe 
a very different range of stellar masses than the abundance 
studies (1 -2M0 versus ~ IOM0), and that a top-heavy IMF 
does not necessarily imply an overall change in the slope but 
could instead be caused by a time-varying characteristic mass 
(Larson 1998). 

Interestingly, there is some indirect evidence for a top- 
heavy IMF from the combination of our data with recent re- 
sults for galaxies at higher redshifts. Although many massive 
galaxies at z 2.5 are still in the process of forming stars (see 
§ 1), there is also a population with very low specific star for- 
mation rates and strong Balmer/4000 A breaks (e.g., Kriek et 
al. 2006a,b). These objects formed most of their stars at red- 
shifts z ^ 3 or beyond, inconsistent with the star formation 
epoch that we derive for Salpeter-like IMFs but in agreement 
with our results for a top-heavy IMF. Alternative explanations 
are also possible; e.g., the apparently old z ^ 2.5 galaxies may 
be "rejuvenated" at later times (see, e.g., Treu et al. 2005b), or 
they could simply be the oldest objects in a wide distribution 
of progenitor galaxies. 

The age difference between massive field and cluster galax- 
ies is better constrained than their absolute ages, as it is less 
affected by the uncertainty in the IMF. Our self-consistent 
modeling approach gives different constraints than the stan- 
dard approach, as it has less free parameters. Assuming that 
progenitor bias is similar for massive cluster and field galax- 
ies, they have a Salpeter-like IMF, and they have the same 
metallicity and dust content at a given mass, we find that mas- 
sive field galaxies are younger than massive cluster galaxies, 
but that the difference is small at 4.1 ±2.0% (or 0.4 ±0.2 
Gyr). If a top-heavy IMF with x = is assumed for both field 
and cluster galaxies the age difference is 3.0 % ± 1.5 %. 

We note that the quoted errors do not include systematic er- 
rors in the comparison of field and cluster galaxies, which are 
difficult to quantify. Possible sources of such errors are dif- 
ferences in dynamical structure that deviate from pure homol- 
ogy, selection effects induced by the magnitude limits of the 
field samples (although these effects are more relevant for low 
mass galaxies; see Treu et al. 2005, van der Wei et al. 2005), 
a difference in n and/or A (which would reflect differences in 
the IMF or the metallicity), and a difference in the amount of 
progenitor bias. We tested the effect of a difference in pro- 
genitor bias by assuming that cluster early-type galaxies are 



not affected by it whereas the M /L ratios of field early-type 
galaxies need to be corrected by -0. 1 x z. In that case the age 
difference increases by a factor of two, to « 9 %. The im- 
portance of progenitor bias is bounded by the observed scat- 
ter in the color-magnitude relation of early-type galaxies (van 
Dokkum & Franx 2001). In clusters this scatter remains low 
all the way to z ^ 1.3 (Mei et al. 2006), but it is not yet clear 
whether this also applies to massive field early-type galaxies 
(see, e.g., Ferreras et al. 2005). 

The small age difference we find is consistent with most, 
but not all, previous studies. It is consistent with the high red- 
shift field FP studies of Treu et al. (2005b) and van der Wei 
et al. (2005), the analysis of a sample of gravitational lenses 
by Rusin & Kochanek (2005), and with studies of the local 
Mg2 - cr relation and FP by Bernardi et al. (1998, 2003a, 
2006). However, other studies find larger age differences (al- 
ways in the sense that field galaxies are younger than cluster 
galaxies). For nearby massive early-type galaxies, Thomas 
et al. (2005) and Clemens et al. (2006) find age differences of 
1.5-2 Gyr from complex models which include age, metal- 
licity, a-enhancement, and (in the case of Clemens et al.) 
Carbon-enhancement as free parameters. Such large age dif- 
ferences are inconsistent with our analysis at > 3(7. Perhaps 
most importantly, our results are not in agreement with the 
recent study of di Serego Alighieri et al. (2006), who com- 
pare the M /L ratios of field galaxies at z 1 from Treu et al. 
(2005b) and di Serego Alighieri et al. (2005) to that of galax- 
ies in two clusters with mean redshift (z) = 0.86 from J0r- 
gensen et al. (2006). For galaxies with masses > 10^' Mq di 
Serego Alighieri et al. (2006) find a very large age difference 
of 3.5 - 4 Gyr (their Fig. 4). The source of the discrepancy 
between their results and ours is, at least in part, due to the 
cluster data that are used in the comparison to field galaxies. 
As discussed in AppendixlAlthe M /L ratios of the galaxies in 
the J0rgensen et al. (2006) study need to be corrected down- 
ward by a factor 1 +z. Applying this offset to the (z) = 0.86 
cluster galaxies in Figs. 1 and 2 of di Serego Alighieri et al. 
(2006) brings them in line with the z ^ 1 field galaxies.^ 

The small age difference between massive field and cluster 
early-type galaxies that we find is remarkable in the context 
of "standard" hierarchical models, which had predicted that 
early-type galaxies in clusters have much older stellar popu- 
lations than those in the general field (e.g., Baugh, Cole, & 
Frenk 1996; Kauffmann 1996; Kauffmann & Chariot 1998). 
Specifically, as shown in van Dokkum et al. (2001), the mod- 
els of Kauffmann et al. (1999) and Diaferio et al. (2001) pre- 
dicted a systematic offset between field and cluster galaxies 
of ^ 0.25 in \og{M/LB) at all redshifts, clearly at odds with 
observations. The reason for this environmental dependence 
in these models is that gas cooling and subsequent star for- 
mation continues as long as a galaxy is the central galaxy in 
its halo. Star formation terminates when a galaxy becomes a 
satellite in a larger halo, which naturally leads to a large pop- 
ulation of "red and dead" galaxies in clusters and blue star 
forming galaxies in the field. This inability of standard mod- 
els to produce red field galaxies is well known, and among 
proposed solutions is to prevent cooling by postulating that 
the gas is heated by a central active galactic nucleus (AGN) 
(e.g.. Bower et al. 2006; Kang, Jing, & Silk 2006; Croton et 
al. 2006). The recent study of De Lucia et al. (2006) incor- 

' See also the en'atum published by di Serego Alighieri et al. (2006) in 
response to our discovery of this discrepancy. Their revised conclusions are 
consistent with those presented here. 
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porates such AGN feedback in a very large cosmological sim- 
ulation. In their simulation, the median star formation epoch 

~ 2.6 for elliptical galaxies with masses M > 10" Mq, in 
good agreement with our results. Furthermore, judging from 
their Fig. 7 the difference in age between ellipticals in halos 
of IO'-'M© and 10'^ is only 0.7 Gyr, consistent with 
our study. We note that this success does not necessarily im- 
ply that the models can reproduce the observed evolution of 
the FP. Building on the semi-analytical models of Baugh et 
al. (2005) and Bower et al. (2006), Almeida, Baugh, & Lacey 
(2006) find that evolution in the radii of early-type galaxies 
compensates for the evolution in their M/L ratios, such that 
the zeropoint of the FP is approximately constant with red- 
shift - inconsistent with the observations. 

Taken at face value, the age difference between field and 
cluster galaxies is ~ 0.4 ±0.2 Gyr, which should be fairly eas- 
ily detectable at z > 2. Interestingly, this number agrees rea- 
sonably well with the age difference that Steidel et al. (2005) 



find comparing Lyman break galaxies in the general field to 
those in a proto-cluster at z = 2.3. It is also qualitatively con- 
sistent with the finding that red galaxies at z ~ 2.5 cluster 
more strongly than blue galaxies (Quadri et al. 2006). 



We thank Arjen van der Wei and Brad Holden for their 
sleuthing, and the referee, Inger J0rgensen, for her comments 
and generous sharing of unpublished data. Support from 
NASA grants HF 01 126.01-99A, HST AR-09541.01-A, and 
LTSA NNG04GE12G, and from National Science Foundation 
grant NSF CAREER AST 04-49678 is gratefully acknowl- 
edged. The authors wish to recognize and acknowledge the 
very significant cultural role and reverence that the summit of 
Mauna Kea has always had within the indigenous Hawai'ian 
community. We are most fortunate to have the opportunity to 
conduct observations from this mountain. 



REFERENCES 



Almeida, C, Baugh, C. M., & Lacey, C. G. 2006, MNRAS, submitted 

I astro-pli/0608544 1 
Baugn, L,. M., uoie, S., & Frenic, C. S. 1996, MNRAS, 283, 1361 
Baugh, C. M., Lacey, C. G., Frenk, C. S., Granato, G. L., Silva, L., Bressan, 

A., Benson, A. J., & Cole, S. 2005, MNRAS, 356, 1191 
Beers, T. C, Flynn, K., & Gebhardt, K. 1990, AJ, 100, 32 
Bell, E. F, Naab, T., Mcintosh, D. H., Somerville, R. S., Caldwell, J. A. R., 

Barden, M., Wolf, C, Rix, H.-W., et al. 2006, ApJ, 640, 241 
Bemardi, M., Nichol, R. C, Sheth, R. K., Miller, C. J., & Brinkmann, J. 2006, 

AJ, 131, 1288 

Bernai'di, M., Renzini, A., da Costa, L. N., Wegner, G., Alonso, M. V., 
Pellegrini, R S., Rite, C, & Willmer, C. N. A. 1998, ApJ, 508, L143 

Bernai'di, M., Sheth, R. K., Annis, J., Buries, S., Eisenstein, D. J., Finkbeiner, 
D. R, Hogg, D. W., Lupton, R. H., et al. 2003a, AJ, 125, 1866 

Bemardi, M., Sheth, R. K., Annis, J., Buries, S., Eisenstein, D. J., Finkbeiner, 
D. R, Hogg, D. W., Lupton, R. H., et al. 2003b, AJ, 125, 1817 

Bessell, M. S. 1990, PASP, 102, 1181 

Blakeslee, J. P., Holden, B. P., Franx, M., Rosati, P., Bouwens, R. J., Demarco, 

R., Ford, H. C, et al. 2006, ApJ, 644, 30 
Bower, R. G., Benson, A. J., Malbon, R., Helly, J. C, Frenk, C. S., Baugh, 

C. M., Cole, S., & Lacey, C. G. 2006, MNRAS, 370, 645 
Boylan-Kolchin, M., Ma, C.-P, & Quataert, E. 2006, MNRAS, 534 
Bruzual, G. & Chariot, S. 2003, MNRAS, 344, 1000 
Buser, R., & Kurucz, R. L. 1978, A&A, 70, 555 

Cappellari, M., Bacon, R., Bureau, M., Damen, M. C, Davies, R. L., de 
Zeeuw, R T., Emsellem, E., Falcon-Barroso, J., et al. 2006, MNRAS, 366, 
1126 

Chabrier, G. 2003, PASP, 115, 763 

Clemens, M. S., Bressan, A., Nikolic, B., Alexander, P., Annibali, F., & 

Rampazzo, R. 2006, MNRAS, 649 
Coleman, G. D., Wu, C.-C, & Weedman, D. W. 1980, ApJS, 43, 393 
Croton, D. J, Springel, V., White, S. D. M., De Lucia, G., Frenk, C. S., Gao, 

L., Jenkins, A., Kauffmann, G., et al. 2006, MNRAS, 365, 11 
Daddi, E., Rottgering, H. J. A., Labbe, I., Rudnick, G., Franx, M., Moorwood, 

A. F M., Rix, H. W., van der Werf, P P, & van Dokkum, P G. 2003, ApJ, 

588, 50 

Daddi, E., Cimatti, A., Renzini, A., Vernet, J., Conselice, C, Pozzetti, L., 

Mignoh, M., Tozzi, P, et al. 2004, ApJ, 600, L127 
De Lucia, G., Springel, V., White, S. D. M., Croton, D., & Kauffmann, G. 

2006, MNRAS, 366, 499 
Diaferio, A., Kauffmann, G., Balogh, M. L., White, S. D. M., Schade, D., & 

Ellingson, E. 2001, MNRAS, 323, 999 
di Serego Alighieri, S., Lanzoni, B., & J0rgensen, I. 2006, ApJ, 647, L99 
di Serego Alighieri, S., Vernet, J., Cimatti, A., Lanzoni, B., Cassata, P., Ciotti, 

L., Daddi, E., Mignoh, M., et al. 2005, A&A, 442, 125 
Djorgovski, S. & Davis, M. 1987, ApJ, 313, 59 

Dressier, A., Oemler, A. J., Couch, W. J., Small, I., Elhs, R. S., Barger, A., 

Butcher, H., Poggianti, B. M., et al. 1997, ApJ, 490, 577 
Faber, S. M., Dressier, A., Davies, R. L., Burstein, D., Lynden-Bell, D., 

Terlevich, R., Wegner, G. 1987, in Nearly Normal Galaxies, ed. S. M. 

Faber (New York: Springer), 175 
Ferreras, I., Lisker, T., Carollo, C. M., Lilly, S. J., Mobasher, B. 2005, ApJ, 

635, 243 

Forster Schreiber, N. M., van Dokkum, P. G., Franx, M., Labbe, I., Rudnick, 

G., Daddi, E., lllingworth, G. D., Kriek, M., et al. 2004, ApJ, 616, 40 
Franx, M. 1993, PASP, 105, 1058 

Franx, M., Labbe, I., Rudnick, G., van Dokkum, P. G., Daddi, E., Forster 
Schreiber, N. M., Moorwood, A., Rix, H., et al. 2003, ApJ, 587, L79 

Freedman, W. L., Madore, B. F., Gibson, B. K., Ferrarese, L., Kelson, D. D., 
Sakai, S., Mould, J. R., et al. 2001, ApJ, 553, 47 

Fritz, A., Ziegler, B. L., Bower, R. G., Small, I., & Davies, R. L. 2005, 
MNRAS, 358, 233 



Fukugita, M., Ichikawa, T., Gunn, J. E., Doi, M., Shimasaku, K., & Schneider, 

D. P 1996, AJ, 111, 1748 
Gomez, P L., Hughes, J. P, & Birkinshaw, M. 2000, ApJ, 540, 726 
Gonzalez-Garci'a, A. C. & van Albada, T. S. 2003, MNRAS, 342, L36 
Grazian, A., Fontana, A., Moscardini, L., Salimbeni, S., Menci, N., 

Giallongo, E., de Santis, C, Gallozzi, S., et al. 2006, A&A, 453, 507 
Guzman, R., Koo, D. C, Faber, S. M., Ilhngworth, G. D., Takamiya, M., 

Kron, R. G., & Bershady, M. A. 1996, ApJ, 460, L5 
Holden, B. P., van der Wei, A., Franx, M., lllingworth, G. D., Blakeslee, J. P., 

van Dokkum, P, Ford, H., Magee, D., et al. 2005, ApJ, 620, L83 
Holden, B. P., Franx, M., lllingworth, G. D., Postman, M., Blakeslee, J. P., 

Homeier, N., Demarco, R., et al. 2006, ApJ, 642, L123 
Holtzman, J. A., Bunows, C. J., Casertano, S., Hester, J. J., Trauger, J. T., 

Watson, A. M., & Worthey, G. 1995, PASP, 107, 1065 
Hudson, M. J., Lucey, J. R., Smith, R. J., Schlegel, D. J., & Davies, R. L. 

2001, MNRAS, 327, 265 
J0rgensen, I., Bergmann, M., Davies, R., Barr, J., Takamiya, M., & Crampton, 

D. 2005, AJ, 129, 1249 
J0rgensen, I., Chiboucas, K., FUnt, K., Bergmann, M., Barr, J., & Davies, R. 

2006, ApJ, 639, L9 
J0rgensen, I., Franx, M., Hjorth, J., & van Dokkum, P G. 1999, MNRAS, 

308, 833 

J0rgensen, 1., Franx, M., & Kja;rgaard, P 1992, A&AS, 95, 489 
J0rgensen, 1., Franx, M., & Kjajrgaai'd, P 1995a, MNRAS, 273, 1097 
J0rgensen, 1., Franx, M., & Kja;rgaard, P 1995b, MNRAS, 276, 1341 
— . 1996, MNRAS, 280, 167 

Kang, X., Jing, Y. P, & Silk, J. 2006, ApJ, 648, 820 

Kauffmann, G. 1996, MNRAS, 281, 487 

Kauffmann, G. & Chariot, S. 1998, MNRAS, 297, L23 

Kauffmann, G., Colberg, J. M., Diaferio, A., & White, S. D. M. 1999, 

MNRAS, 303, 188 
Kelson, D. D. 2003, PASP, 115, 688 

Kelson, D. D., lllingworth, G. D., van Dokkum, P G., & Franx, M. 2000a, 

ApJ, 531, 137 
— . 2000b, ApJ, 531, 159 
— . 2000c, ApJ, 531, 184 

Kelson, D. D., van Dokkum, P. G., Franx, M., Ilhngworth, G. D., & Fabricant, 

D. 1997, ApJ, 478, L13 
Knudsen, K. K., van der Werf, P., Franx, M., Forster Schreiber, N. M., van 

Dokkum, P. G., lllingworth, G. D., Labbe, I., Moorwood, A., et al. 2005, 

ApJ, 632, L9 

Kriek, M., van Dokkum, P., Franx, M., Forster Schreiber, N., Gawiser, E., 
lllingworth, G., Labbe, I., Mai-chesini, D., et al. 2006, ApJ, 645, 44 

Kriek, M., van Dokkum, P., Franx, M., Quadri, R., Gawiser, E., Herrera, D., 
lllingworth, G., Labbe, 1., et al. 2006, ApJ, in press ( astro-ph/0608446 i 

Krist, J. 1995, in ASP Conf. Ser. 77: Astronomical Data Analysis software 
and Systems IV, Vol. 4, 349 

Kroupa, P 2001, MNRAS, 322, 231 

Labbe, 1., Franx, M., Rudnick, G., Schreiber, N. M. F, Rix, H., Moorwood, 
A., van Dokkum, P G., van der Werf, P, et al. 2003, AJ, 125, 1 107 

Labbe, 1., Huang, J., Franx, M., Rudnick, G., Barmby, P., Daddi, E., van 
Dokkum, P G., Fazio, G. G., et al. 2005, ApJ, 624, L81 

Larson, R. B. 1998, MNRAS, 301, 569 

Le Borgne, J. F, Pello, R., & Sanahuja, B. 1992, A&AS, 95, 87 
Lubin, L. M. & Sandage, A. 2001, AJ, 122, 1084 
Maraston, C. 2005, MNRAS, 362, 799 

Mei, S., Holden, B. P., Blakeslee, J. P., Rosati, P., Postman, M., Jee, M. J., 

Rettura, A., et al. 2006, ApJ, 644, 759 
Moran, S. M., Elhs, R. S., Treu, T., Small, I., Dressier, A., Coil, A. L., & 

Smith, G. P 2005, ApJ, 634, 977 
Nagamine, K., Cen, R., Hernquist, L., Ostriker, J. P., & Springel, V. 2005, 

ApJ, 627, 608 



16 



Star Formation Epoch of Early-Type Galaxies 



Nagashima, M., Lacey, C. G., Okamoto, T., Baugh, C. M., Frenk, C. S., & 

Cole, S. 2005, MNRAS, 363, L31 
Pahre, M. A., Djorgovski, S. G., & de Carvalho, R. R. 1996, ApJ, 456, L79+ 
Papovich, C., Moustakas, L. A., Dickinson, M., Le Floc'h, E., Rieke, G. H., 

Daddi, E., Alexander, D. M., Bauer, E, et al. 2006, ApJ, 640, 92 
Quadri, R., van Dokkum, P., Gawiser, E., Eranx, M., Marchesini, D., Lira, P., 

Rudnick, G., Herrera, D., et al. 2006, ApJ, in press (astro-ph/0606330l 
Reddy, N. A., Erb, D. K., Steidel, C. C, Shapley, A. E., Adelberger, K. L., & 

Pettini, M. 2005, ApJ, 633, 748 
Robertson, B., Cox, T. J., Hernquist, L., Franx, M., Hopkins, P. E, Martini, 

P, & Springel, V. 2006, ApJ, 641, 21 
Rubin, K. H. R., van Dokkum, P. G., Coppi, P., Johnson, O., Forster 

Schreiber, N. M., Franx, M., & van der Wert', P 2004, ApJ, 613, L5 
Rusin, D., Kochanek, C. S., Ealco, E. E., Keeton, C. R., McLeod, B. A., 

Impey, C. D., Lehar, J., et al. 2003, ApJ, 587, 143 
Rusin, D., & Kochanek, C. S. 2005, ApJ, 623, 666 
Salpeter, E. E. 1955, ApJ, 121, 161 

Schlegel, D. J., Finkbeiner, D. P, & Davis, M. 1998, ApJ, 500, 525 
Schneider, R., Ferrara, A., Natarajan, R, & Omukai, K. 2002, ApJ, 571, 30 
Sirianni, M. J. J., Bem'tez, N., Blakeslee, J. P., Martel, A. R., Meurer, G., 

Clampin, M., de Marchi, G., Ford, H. C, et al. 2005, PASP, 1 17, 1049 
Smail, I., Dressier, A., Couch, W. J., Ellis, R. S., Oemler, A. J., Butcher, H., 

& Sharpies, R. M. 1997, ApJS, 1 10, 213 [MORPHS] 
Steidel, C. C, Adelberger, K. L., Shapley, A. E., Erb, D. K., Reddy, N. A., & 

Pettini, M. 2005, ApJ, 626, 44 
Thomas, D., Maraston, C, Bender, R., & Mendes de Oliveira, C. 2005, ApJ, 

621, 673 

Tinsley, B. M. 1980, Fundamentals of Cosmic Physics, 5, 287 
Tonry, J. & Davis, M. 1979, AJ, 84, 1511 

Tran, K.-V. H., van Dokkum, P, Franx, M., Illingworth, G. D., Kelson, D. D., 

& Schreiber, N. M. F 2005a, ApJ, 627, L25 
Tran, K.-V. H., van Dokkum, P., Illingworth, G. D., Kelson, D., Gonzalez, A., 

& Franx, M. 2005b, ApJ, 619, 134 
Treu, T, Ellis, R. S., Liao, T. X., & van Dokkum, R G. 2005a, ApJ, 622, L5 
Treu, T., ElHs, R. S., Liao, T. X., van Dokkum, R G., Tozzi, R, Coil, A., 

Newman, J., Cooper, M. C, et al. 2005b, ApJ, 633, 174 
Treu, T., Stiavelli, M., Casertano, S., Moller, P, & Berlin, G. 1999, MNRAS, 

308, 1037 



— . 2002, ApJ, 564, L13 

van der Marel, R. P., & van Dokkum, P. G. 2006a, ApJ, submitted 

[vdMvD06a] (astro-ph/0611571 1 
van der Marel, R. P., & van Dokkum, P. G. 2006b, ApJ, submitted 

[vdMvD06b] I astro-ph/061 1577 I 
van der Wei, A., Franx, M., van Dokkum, R G., & Rix, H.-W. 2004, ApJ, 

601, L5 

van der Wei, A., Franx, M., van Dokkum, P. G., Rix, H.-W., Illingworth, 

G. D., & Rosati, P 2005, ApJ, 631, 145 
van de Ven, G., van Dokkum, R G., & Franx, M. 2003, MNRAS, 344, 924 
van Dokkum, P G. 2001, PASP, 113, 1420 
— . 2005, AJ, 130, 2647 

van Dokkum, P G. & Ellis, R. S. 2003, ApJ, 592, L53 

van Dokkum, P. G., Forster Schreiber, N. M., Eranx, M., Daddi, E., 

Illingworth, G. D., Labbe, I., Moorwood, A., Rix, H., et al. 2003, ApJ, 

587, L83 

van Dokkum, P G. & Franx, M. 1996, MNRAS, 281, 985 
— . 2001, ApJ, 553,90 

van Dokkum, P G., Franx, M., Kelson, D. D., & Illingworth, G. D. 1998a, 

ApJ, 504, L17 
— . 2001, ApJ, 553, L39 

van Dokkum, P. G., Eranx, M., Kelson, D. D., Illingworth, G. D., Fisher, D., 

& Fabricant, D. 1998b, ApJ, 500, 714 
van Dokkum, P. G., Quadri, R., Marchesini, D., Rudnick, G., Franx, M., 

Gawiser, E., Herrera, D., Wuyts, S., et al. 2006, ApJ, 638, L59 
van Dokkum, P G. & Stanford, S. A. 2003, ApJ, 585, 78 
Webb, T. M. A., van Dokkum, P., Egami, E., Fazio, G., Franx, M., Gawiser, 

E., Hen-era, D., Huang, J., et al. 2006, ApJ, 636, L17 
Worthey, G. 1994, ApJS, 95, 107 

Worthey, G., Faber, S. M., & Gonzalez, J. J. 1992, ApJ, 398, 69 

Wuyts, S., van Dokkum, R G., Kelson, D. D., Eranx, M., & Illingworth, G. D. 

2004, ApJ, 605, 677 
Yan, H., Dickinson, M., Eisenhardt, P. R. M., Ferguson, H. C, Grogin, N. A., 

Paolillo, M., Chary, R.-R., Casertano, S., et al. 2004, ApJ, 616, 63 
Ziegler, B. L., Bower, R. G., Smail, I., Davies, R. L., & Lee, D. 2001, 

MNRAS, 325, 1571 



APPENDIX 

LITERATURE DATA FOR CLUSTER GALAXIES 

In addition to the three clusters discussed in this paper we used eleven additional distant clusters in the analysis. We also 
included the Coma cluster and a sample of SDSS galaxies in the nearby universe. Here we describe the various transformations 
that were applied to bring the published data to our system. We also describe sources of systematic uncertainty for each cluster 
In most cases morphological information is available; only early-type galaxies with masses > 10" were included in the 
calculation of the M /L offsets. 

Coma (Abell 1656) 

The value of cq in Eq. [8]is set to that of the nearby Coma cluster This choice is arbitrary, as adding a constant to all values 
of Alog{M /Lb) does not change our results in any way. We use the sample of J0rgensen et al. (1996). Structural parameters 
measured in the B band are given in J0rgensen et al. (1995a), and velocity dispersions corrected to a 1 "7 radius aperture are listed 
in J0rgensen et al. (1995b). The listed surface brightnesses are corrected from the average brightness within the effective radius 
{fi)e to that at the effective radius: fi^ = (/^)e+ 1-393. Effective radii are converted from arcseconds to kpc by adopting a Hubble 
flow velocity Vflow = 7376 ± 223 km s"' (see vdMvD06b). 

Restricting the sample to early-type galaxies with M > 10" M0 we find cq = -9.626. The value of Alog(M /Lb) is zero by 
definition, with a random uncertainty of ±0.014. The systematic uncertainty in log{M/LB) is a combination of several factors. 
Zeropoint uncertainties in the B band photometry give ±0.010 (J0rgensen, Franx, & Kjsergaard 1992), and the uncertainty in 
Vflow implies ±0.013. A comparison of studies by different authors gives ±0.02 (see Hudson et al. 2001, and vdMvD06b); this 
uncertainty includes (and may be dominated by) systematic differences in the methodology for deriving structural parameters and 
velocity dispersions. As discussed in vdMvD06b several other systematic uncertainties cancel in the comparison to the distant 
clusters. For example, the uncertainty in the Hubble constant cancels because we are only concerned with the evolution of the 
M /Lb ratio and not with the absolute value. Also, as discussed in van Dokkum & Franx (1996) our methodology for measuring 
velocity dispersions of galaxies in distant clusters mimics that of J0rgensen et al. in the nearby universe. The combined systematic 
error is 0.026, and assuming that the random and systematic errors can be added in quadrature we obtain /S.\og{M / Lb) = 0.000 ± 
0.029. 

The Bernardi Sample 

Bernardi et al. (2003a) have studied the fundamental plane relation of galaxies in the SDSS. This sample has several advantages 
over previous studies: it is large (the total sample comprises 8661 objects), does not suffer from uncertainties due to peculiar 
velocities (both because they average out over the large volume probed by the data and because of the relatively high typical 
redshift of 0.1), has very homogeneous spectroscopy and multi-band photometry, probes a large range of environments, and 
spans a range in redshift so that in principle evolutionary effects can be studied within the sample. An important drawback 
specific to this sample is that the magnitude-selection causes significant biases which need to be taken into account (Bernardi 
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et al. 2003a); such biases are less important (although not absent) for cluster samples as all galaxies are at the same distance. 
Furthermore, as noted by, e.g., Cappellari et al. (2006) the coefficients a and b that Barnardi et al. derive do not appear to be 
consistent with most other studies of the FP; the cause for this discrepancy is unclear at present. 

Velocity dispersions and structural parameters in the rest-frame g' band were obtained from Table 3b of Bernardi et al. (2003b). 
Effective radii were converted to our cosmology. Rest-frame g' band surface brightnesses were converted to the rest-frame B band 
using the transformation 

B = g' + 0A4-(g'-r') + 0.n, (Al) 

with B on the Vega system and g' and r' on the AB system. For consistency this transformation was derived using the same 
method and B band filter curve as Eq. [T] with the only difference that the redshift was set to zero (i.e., the g' and r' filters 
were not redshifted). Fukugita et al. (1996) give B, = g' +QA2(g' - r') + Q.22; for the typical colors of galaxies in the sample 
this transformation is consistent with ours to 0.04 mag. The g' -r' colors cannot be obtained by simply taking the difference 
{lig')e- {lJ.r')e as the surface brightness was evaluated within a different radius in each band. Therefore we first corrected the 
surface brightnesses to a common radius using the measured effective radii of the galaxies (see van der Wei et al. 2005). Finally 
we applied the transformation /if = {fJ.)e+ 1.393. We note that the uncertainty in this procedure is ultimately determined by the 
difference between the observed band and the final rest-frame band. In this case, the observed band is the g' band and the final 
rest-frame band is the B band, and at z ^ 0.1 these are very well matched. 

Bernardi et al. (2003b) corrected their velocity dispersions to an aperture of radius OAlSr^, which for almost all galaxies is 
significantly smaller than the 1 ."7 aperture at the distance of Coma that we adopted. We first retrieved the dispersions as measured 
through the 1 ."5 SDSS fibers using the measured effective radii (in arcsec) and Eq. 1 of Bernardi et al. (2003b). Then we corrected 
these measured dispersions to a 1 ."7 aperture at the distance of Coma using the redshifts of the galaxies and the prescription of 
J0rgensen et al. (1995b). 

FP offsets were determined for all galaxies in the sample with local density «den > 20, using a = 1 .20, b = -0.83, and cq = -9.626. 
Following Bernardi et al. (2003b) (their Fig. 10) and Bernardi et al. (2003a) (their Fig. 9) galaxies with Wden = 100 were 
excluded. As discussed extensively in Bernardi et al. (2003a) average M/L ratios inferred from these data need to be corrected 
for selection effects. Fitting to the valu es lis ted in their Fig. 7 gives Aseiectlog(M/Lg') = -0.289z and Aseiectlog(M/Lr') = — 0.208z. 
Extrapolating to the B band using Eq. lAll gives Aseiectlog(M/LB) = -0.320z. Bernardi et al. determined the effect for the full 
sample, and not for the subsample of galaxies with M > 10" Mq that is relevant here. Selection effects will be less important for 
the massive galaxies, but it is not clear by how much. We assume that the effect is half that of the full sample, and conservatively 
assign an uncertainty which encompasses the full range of possibilities: 

Alog(M/LB,eoiT) = Alog(M/LB) + (0.16±0.16)z (A2) 

Given the uncertainty in the redshift evolution within the Bernardi et al. sample we evaluate the average M /Lb offset at one 
redshift only. We chose z ~ 0.10, as selection effects do not play a large role at that redshift and the local galaxy densities 
«den are not well determined for z < 0.08 (Bernardi et al. 2003b). In practice we select the 171 galaxies with M > 10" M0, 
20 < Hden < 100, and 0.075 < z < 0.14. The average redshift of this sample is z = 0.109. 

The value of the M/L offset is listed in Table 3. The random uncertainty is only ±0.008. The uncertainty in the photometric 
transformation is taken to be the difference between the Fukugita et al. (1996) transformation and ours (±0.016). The uncertainty 
due to selection effects is 0.16 x 0.109 = 0.017. Bernardi et al. (2003b) estimate that the systematic uncertainty in the velocity 
dispersions is ~ 3 %, implying ±0.019 in logM/Ls- Adding all uncertainties in quadrature gives ±0.030 for the total error in the 
M/L offset. 

For the analysis in § 16.21 we also determined the M/L offset in low-density regions, selecting all 1519 galaxies with M > 
10" M0, 0.075 < z < 0.14, and nden < 10. The average redshift of this sample is 0.1093. The offset (corrected for selection 
effects) is listed in Table 3. The random error is 0.003 and the systematic error is the same as derived for the high-density 
sample. The difference between the high- and low-density offset is 0.031 in log{M /Lb), or 0.08 magnitudes if it is interpreted as 
a difference in surface brightness at fixed r^. and a. This difference appears to be consistent with the trends shown in Fig. 9 of 
Bernardi et al. (2003a) for this redshift range. 

Abell 2218 

Abell 2218 is a well-studied very rich cluster at z = 0.1756 (Le Borgne, Pello, & Sanahuja 1992). Its FP has been analyzed 
in two independent studies, J0rgensen et al. (1999) and Ziegler et al. (2001). J0rgensen et al. (1999) give velocity dispersions 
and two sets of structural parameters, one derived from HST images and one derived from ground-based data. Their velocity 
dispersions have already been corrected to the 1 ."7 aperture at the distance of Coma. The ground-based structural parameters 
were corrected from the observed band to the rest- frame B band using the listed V -I colors: 

B, = /,.+ 1.29(y-/c) + 0.18, (A3) 

and corrected from {^)e to ^g. The structural parameters measured from the HST F702W images were transformed by J0rgensen 
et al. to the 7^. band, and we use Eq. IA3l to transform the surface brightness to rest-frame B. 

Ziegler et al. (2001) obtained independent spectroscopy for galaxies in Abell 2218, and give velocity dispersions which were 
corrected to the same 1 "7 radius aperture at the distance of Coma as we use. They used the same HST imaging as J0rgensen 
et al. (1999), but derive their own structural parameters from the images. We transformed their listed rest-frame Gunn r surface 
brightnesses from Lq pc"^ to mag arcsec"^ using their Eq. 7, and then converted them back to observed magnitudes using their 
Eq. 1. In this conversion we used the listed V and 7^. magnitudes after applying extinction corrections of 0.083 mag in V and 
0.048 mag in Ic- The 7^. surface brightnesses were transformed from to and corrected to rest-frame B using Eq. lA3l 
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Combining all information there are three measurements of the FP: 1) dispersions and HST structural parameters from J0r- 
gensen et al. (1999); 2) dispersions and ground-based structural parameters from J0rgensen et al. (1999); and 3) dispersions 
and HST structural parameters from Ziegler et al. (2001). The two sets of dispersions and the two sets of ground-based data are 
independent. We determined the M/L offset for all three cases and compared the results. The offsets agree very well: differences 
are approximately 0.02 in \og{M/LB). Rather than take the average of the three determinations we use the zeropoint from mea- 
surement 2): it falls between the other two and it is the one used in the analysis of the J0rgensen et al. (1999) paper. Apart from 
a systematic uncertainty of 0.02 due to the differences between the three determination there is also an uncertainty introduced 
by the fact that all structural parameters were measured in redder bands than the redshifted B band. This uncertainty stems from 
color gradients and the dependence of the FP parameters on passband, and is estimated at 0.05 mag. The combined systematic 
uncertainty is 0.029 in \og{M/LB). 

Abell665 

The FP in Abell 665 (z = 0.1829; Gomez, Hughes, & Birkinshaw 2000) was studied by J0rgensen et al. (1999). As for Abell 
2218 there are two sets of structural parameters, one from HST imaging and the other from ground-based imaging. Contrary to 
Abell 22 1 8 there are also two sets of V - color measurements : one directly measured from the ground and the other transformed 
from /?606-^8i4 HST colors. We applied the same transformations as for Abell 2218, and compared the FP relations derived 
from ground-based data to the FP derived from HST data. The difference is 0.02 in log(M/LB), consistent with the expected 
uncertainties due to the color transformations. The final offset was calculated from the HST measurements in J0rgensen et al. 
(1999). As for Abell 2218 the total systematic uncertainty is estimated at 0.029 in \og{M/LB). 

Abell 2390 

Fritz et al. (2005) present the FP relation in Abell 2390 at z = 0.228. We corrected the listed velocity dispersions to our fiducial 
aperture following J0rgensen et al. (1995b), using l."5 x 2."8 as the extraction aperture. The listed rest-frame Gunn r surface 
brightnesses were converted back to observed WFPC2 /g^ magnitudes using (/xgn),, = (/Xr,z)e-0.75 (see Fritz et al. 2005). We 
converted to /x^ and transformed to the (COSMIC camera) / band using 

/ = 7gi4-0.037(B-7)+0.007(B-7)^+0.00 (A4) 

(Holtzman et al. 1995). The B-I colors were determined from the listed B and / aperture magnitudes. These are not corrected for 
extinction; we applied corrections of 0.476 mag to B and 0.214 mag to I (see Fritz et al. 2005). Rest-frame B surface brightnesses 
were determined using 

Bz=7+0.47(B-/) + 0.51. (A5) 

Based on listed zeropoint uncertainties the uncertainty in the photometric transformations is estimated at 0.03 mag. 

In contrast to our own measurements of the FP in distant clusters no effort was made to determine parameters in the same way 
as the J0rgensen et al. studies at low redshift. There is therefore a systematic uncertainty of w 5 % in the velocity dispersions, 
which is caused by possible differences in the fitting region, choice of templates, and fitting methodology. This corresponds to 
an uncertainty of 0.03 1 in log(M/LB). Combined with a 0.05 mag uncertainty due to the transformations and the fact that the FP 
was determined in red rest-frame bands, we estimate that the total systematic uncertainty is 0.037 in log(M/La). 

CL 1358+62 

Kelson et al. (2000a, 2000b, 2000c) provide an extensive analysis of the FP in the cluster CL 1358+62 at z = 0.327 in the rest- 
frame V band. Listed velocity dispersions (Kelson et al. 2000b) are already corrected to our fiducial aperture. Listed rest-frame 
surface brightnesses and colors were back-corrected to the observed ones using the Eqs. in Kelson et al. (2000a). They were then 
transformed to rest-frame B using 

fi, = /8i4+1.09(/?606-/814) + 0.52 (A6) 

and transformed from (/i)e to /i,.. 

The Kelson et al. methodology is essentially identical to ours, and systematic uncertainties in, e.g., spectral continuum filtering 
cancel. Systematic uncertainties in the dispersions due to possible changes of the spectral templates with redshift are estimated 
at 2 %. Systematic errors in the photometric transformations are estimated at 0.03 mag. The combined systematic uncertainty is 
0.021 in log(M/LB). 

CL0024+16 

van Dokkum & Franx (1996) present the FP in the z = 0.391 cluster CL 0024+16. We use their sample rather than the much 
larger sample of Moran et al. (2005) because van Dokkum & Franx (1996) use identical techniques to ours, and systematic 
uncertainties in the velocity dispersions dominate over random uncertainties due to the limited sample size. We note that Moran 
et al. (2005) find that their data agree with those of van Dokkum & Franx (1996) to within a few percent, both for the velocity 
dispersions of individual objects in conmion between the two samples and for the offset in M/L ratio derived from the FP. 

Listed velocity dispersions have already been corrected to our fiducial aperture. The Usted /-band surface brightnesses were 
corrected to rest-frame B using 

B^=7+1.46(/?-/) + 0.57. (A7) 
The systematic uncertainty stems from the same sources as for CL 1358+62, and is estimated at 0.021 in logiM/Ls). 
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MS 2053-04 and MS 1054-03 

Wuyts et al. (2004) analyze the FP in the two clusters MS 2053-04 (z = 0.583) and MS 1054-03 (z = 0.832). The Wuyts et 
al. study followed initial studies of smaller samples in the two clusters by Kelson et al. (1997) and van Dokkum et al. (1998a) 
respectively. The listed structural parameters and velocity dispersions are already on our system. However, we update the 
transformations to rest-frame B, as the synthetic B filter curve used in Wuyts et al. (2004) is slightly different from the one used 
in the present study. The updated transformations are 

B, =ki4 + 0.38(7^606 - /814) + 0.93 (A8) 

for MS 2053-04 and 

B,=/8i4-0.05(«606-/814)+ 1.22 (A9) 

for MS 1054-03. These transformations are consistent with those given in Wuyts et al. (2004) to within « 0.04 mag for the 
typical colors of early-type galaxies in these clusters. The systematic uncertainty is taken to be the same as for CL 1358H-62. 

RXJOl 52.7-1 357 

Imaging and spectroscopic data of RX J0152-13 (z = 0.837) and RX J1226H-33 (z = 0.892) were obtained in the context of the 
Gemini/Z/ST Galaxy Cluster Project (J0rgensen et al. 2005). J0rgensen et al. (2006) discuss the FP in these clusters, finding that 
the most massive galaxies in these clusters are offset from the Coma FP by AlogM/L^ « -0.25 only. This offset is much smaller 
than had been found previously for the cluster MS 1054-03 at z = 0.83 (Wuyts et al. 2004), indicating significant cluster-to-cluster 
scatter at z ^ 1. J0rgensen et al. (2006) do not provide data for individual galaxies in these clusters, but for RX J0152-13 the 
FP can be constructed by combining velocity dispersions listed in Table 12 of J0rgensen et al. (2005) with structural parameters 
measured by Blakeslee et al. (2006). The velocity dispersions (CTcoit from J0rgensen et al. [2005]) are already on our system, and 
no further corrections are required. Effective radii from Blakeslee et al. were circularized by multiplying them by y/b/a, and 
converted to kpc. The listed magnitudes (!775,gfit in Blakeslee et al. [2006]) were converted to surface brightnesses on our system 
through the transformation 

= /775,gfit + 2.51og2 + 2.51og(7rr2)+ 1.393, (AlO) 

with re in arcsec. Next, surface brightnesses and colors were converted from AB to Vega magnitudes using the Sirianni et al. 
(2005) ACS zeropoints, and transformed to rest-frame B using 

B, = /775-0.35(/775-Z850)+ 1.21. (All) 

Finally, the M/L offset was derived using the same techniques as for the other clusters. We find AlogM/L^ = -0.449, much larger 
than the J0rgensen et al. (2006) offset. This difference is caused by a combination of two effects: small differences in methodology 
(e.g., a 0.05 mag difference between the measured Blakeslee et al. [2005] and J0rgensen et al. [2006] (775 surface brightnesses), 
and an error in the transformation from observed to rest-frame magnitudes. The rest-frame magnitudes in J0rgensen et al. (2006) 
are too faint by a factor of 1 +z.^ 1. J0rgensen kindly determined the offset for galaxies with M > 10" Mq in RX J0152-13 using 
the J0rgensen et al. (2006) data with our transformation from SDSS /' and z' magnitudes to rest-frame B, and our zeropoint for 
the Coma cluster. The offset is -0.475. This value is consistent with ours to < 0.01, when the systematic difference of 0.05 mag 
between the Blakeslee et al. and J0rgensen et al. methodology is taken into account. Sources of systematic error are the same 
as for, e.g., Abell 2218 and Abell 665, but we added 0.02 in quadrature to reflect the difference between the Blakeslee et al. and 
J0rgensen et al. structural parameters. 

RXJ1226.9+3332 

This z = 0.892 cluster is discussed in J0rgensen et al. (2006) together with RX J0152-13. No individual measurements of 
galaxies are available, but 1. J0rgensen determined the offset in M/Lg ratio using our transformations. Coma zeropoint, and 
mass selection. A small correction of 0.05 mag was applied to bring the data to the same system as the Blakeslee et al. (2006) 
photometry (see § IA.9I ). The offset is consistent with the value that is implied by the J0rgensen et al. (2006) study, after correcting 
their data points by a factor 1 + z (§ IA.9t . Sources of systematic uncertainty are the correction to the Blakeslee et al. (2006) 
system (0.02 in logM/L^), systematic uncertainty in the dispersions (0.025), the transformation to rest-frame B (0.012), and the 
photometric zeropoints of the ground-based photometry (0.02). The total systematic uncertainty is 0.040. The random uncertainty 
is 0.039 in logM/Lg, which brings the total error to 0.056. 

RDCS 1252.9-2927 

The FP in the z = 1.237 cluster RDCS 1252 was determined by Holden et al. (2005) using four galaxies, all of which have 
M > 10" Mq. Listed velocity dispersions are already on our system. The observed zsso surface brightnesses were converted from 
{fj,)e to He and converted to rest-frame B using 

B, = Z850 - 0.22(/775 - 7) + 0.99 , (A 1 2) 

where zsso, '775, and J are on the AB system (see Holden et al. 2005). This transformation is slightly different from the one used 
in Holden et al. (2005), due to the use of a different derivation procedure.^ For the typical colors of galaxies in RDCS 1252 the 
difference in the transformation results in a difference of < 0.05 mag in B~. The determination of velocity dispersions from the 
near-UV spectral region may cause systematic errors of 5 % (van Dokkum & Stanford 2003). The total systematic uncertainty is 
0.043 in logiM/Ls). We note that for this cluster and for RDCS 0848 the random uncertainty exceeds the systematic uncertainty. 

^ See the erratum published by J0rgensen et al. (2006) in response to our discovery of this discrepancy. 

' Note that most of the apparent difference between Eq. A12 and the transformation given in Holden et al. (2005) results from the fact that in their equation 
the zsso surface brightness is on the AB system and the 1775 —J color is on the Vega system. 
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RDCS 0848+4453 

The FP for three galaxies in the z = 1.276 cluster RDCS 0848 was presented in van Dokkum & Stanford (2003). The listed 
velocity dispersions are on our system. The F160W surface brightnesses were corrected to rest-frame B using 

B, = //,6o + 0.46(/8i4-i/i6o)+ 1.92. (A13) 

This transformation is slightly different from Eq. 3 in van Dokkum & Stanford (2003), as we use a slightly different synthetic 
B filter in the present study. For the typical colors of galaxies in RDCS 0848 the two transformations are consistent to < 0.05 
mag. The systematic uncertainty is a combination of the uncertainty derived above for CL 1358H-62 and a 5 % uncertainty in the 
dispersions, which originates from the use of the near-UV spectral region. The adopted uncertainty in logiM/Ls) is 0.038. 

LITERATURE DATA FOR FIELD GALAXIES 

van Dokkum et al. (2001 ) and van Dokkum & Ellis (2003) 

These studies give structural parameters and velocity dispersions for morphologically-selected early-type galaxies, van 
Dokkum et al. (2001) give results for 18 galaxies at 0. 15 < z < 0.55 in the foreground of the rich clusters MS 2053-04 (z = 0.58) 
and MS 1054-03 (z = 0.83). van Dokkum & ElUs studied ten galaxies at 0.56 < z < 1 . 10 in the Hubble Deep Field North (HDF- 
N). The methodology for deriving velocity dispersions and structural parameters in these studies is identical to ours. Listed 
surface brightnesses were transformed to rest-frame B following the procedures outlined in the cited papers. Note that this trans- 
formation is different for each galaxy, as it depends on the redshift. Effective radii were converted to kpc. We determined masses 
and offsets in M/Lg ratio in the same way as for individual cluster galaxies (see § 5.1). Eleven of 18 galaxies in the MS 1054-03 
field, and four of ten galaxies in the HDF-N, have masses > 10" Mq. 

vanderWel et al. (2005) 

van der Wei et al. (2005) study a sample of 38 field early-type galaxies in two fields, the Chandra Deep Field South (CDF- 
S) and the field containing the cluster RDCS 1252-29 (z = 1.24). The galaxies were selected by a combination of color- and 
morphological criteria. They span the redshift range 0.62 < z < 1.14, with median 0.97. van der Wei et al. follow the same 
procedures as we do here for determining structural parameters and velocity dispersions. The listed surface brightnesses (Table 
2 in van der Wei et al. 2005) were transformed to the rest-frame B band, and effective radii were converted to kpc. Twenty-five 
galaxies in this sample have M > 10" M©. 

Treu et al. (2005a,b) 

Treu et al. study a very large sample of 226 visually classified bulge-dominated galaxies in the GOODS HDF-N region. 
Although Treu et al. use different software and procedures than we do here, a direct comparison between their data and those of 
van Dokkum & Ellis (2003) (which are on our system) shows that any systematic differences are very small. Specifically, for 
a subset of the galaxies velocity dispersions were determined using the same software as used in the present study (see § 2.5), 
and the two sets of measurements agree to < 1 % on average (see Treu et al. 2005b). Surface brightnesses were converted to 
rest-frame B following the procedure outlined in Treu et al. (2005b). Next, they were converted from {ii)e to /if, and from the 
AB system to the Vega system. Effective radii were converted to kpc. Treu et al. correct the measured velocity dispersions for 
aperture effects by applying a fixed correction of dcon = LlOcap- For consistency we correct the measured values to the 
equivalent of a 3. "4 diameter aperture at the distance of the Coma cluster (see § 2.5). 

Masses and offsets in M /L ratio were calculated following the procedures of § 5.1. Of 141 galaxies classified as early-type 25 
were excluded because they could be misclassified spirals, as judged from the residuals from an r'/"* fit (see § 2.1 in Treu et al. 
2005b). Of the remaining 116 galaxies, 48 haveM > 10" M©. These galaxies are at 0.32 < z < 1.14, with median 0.84. 

Comparison of Individual Samples 

The total number of field early-type galaxies with M > 10" Mq is 88. One galaxy was excluded from this sample: object 1 
from van Dokkum et al. (2001). This galaxy is at z = 0. 183, and as the next lowest redshift is z = 0.321 including this object would 
"artificially" create a very wide bin at low redshift in Fig. 12."^ In § 6.2 we treat the remaining 87 galaxies as a single sample; here 
we test whether this approach is warranted, by examining whether there is evidence for systematic differences between the four 
samples. Data points for individual galaxies are shown in Fig.[TT] We determined a mean offset for each sample in the following 
way. We fitted a linear function to the binned field data listed in Table 3. This function has the form AlogM/L^ = 0. 045-0. 629z. 
After subtracting this linear function from the individual data points, the average of the residuals was determined for each of 
the four literature samples using the biweight estimator. We find the following average offsets in logM/Ls'. -0.01 ±0.01 (van 
Dokkum et al. 2001); -0.01 ± 0.04 (van Dokkum & ElHs 2003); +0.02 ± 0.04 (van der Wei et al. 2005); and -0.01 ± 0.02 (Treu 
et al. 2005b). These values are consistent with each other, and we conclude that it is appropriate to treat the four samples as a 
single, large sample. 



We note that this object follows the trend defined by the other 87 galaxies. 
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Fig. 11. — Individual field early-type galaxies with M > lO" M© in the four samples that are combined in this study. Filled triangles are 
galaxies from van Dokkum et al. (2001); flUed squares are from van Dokkum & Ellis (2003); filled circles are from van der Wei et al. (2005); 
and open circles are from Treu et al. (2005b). 



TABLE 2 

Galaxy Sample 



ID" 


Type" 


z 


a (km/s)* 


± 


logr,(")^ 




V555-/8 


3C 295-47 


E/SO 


0.1308 












3C 295-568 


SO/Sb 


0.4545 


152 


10 


-0.092 


23.68 




3C 295-834 


E 


0.4651 


165 


14 


-0.801 


21.35 




3C 295-868 


SO 


0.4565 


197 


18 


-0.439 


22.93 




3C 295-968 


E 


0.4601 


213 


12 


-0.176 


23.64 




3C 295-2014 


E/SO 


0.4616 


284 


22 


0.583 


25.33 




CL 0016^38 


E 


0.5399 


229 


14 


-0.269 


22.96 


2.460 


CL 00 16-461 


E 


0.5458 


268 


24 


-0.532 


22.37 


2.461 


CL 0016-531 


E 


0.5420 


209 


21 


-0.687 


21.98 


2.443 


CL 0016-611 


E 


0.5509 


169 


18 


-0.441 


22.71 


2.466 


CL 0016-612 


E 


0.5508 


279 


39 


-0.322 


22.75 


2.574 


CL 0016-650 


E/SO 


0.5445 


172 


20 


0.633 


26.35 


2.482 


CL 0016-659 


E 


0.5502 


264 


27 


-0.293 


23.13 


2.441 


CL 0016-724 


E 


0.5463 


252 


17 


0.462 


25.46 


2.582 


CL 0016-725 


E 


0.5389 


226 


18 


0.739 


26.18 


2.531 


CL 0016-745 


E 


0.5478 


124 


14 


-0.105 


24.06 


2.519 


CL 0016-2050 


E 


0.5536 


125 


18 


-0.444 


23.11 


2.366 


CL 1601-270 


SO/E 


0.5098 


186 


12 


-0.315 


22.84 




CL 1601-292 


E 


0.5425 


179 


12 


-0.301 


22.99 




CL 1601^74 


E/SO 


0.5376 


313 


51 


0.033 


24.30 




CL 1601-524 


E 


0.5403 


226 


16 


-0.199 


23.50 




CL 1601-619 


E 


0.5392 


232 


12 


-0.044 


24.00 




CL 1601-753 


E 


0.5395 


281 


12 


0.131 


24.20 




CL 1601-814 


E 


0.5409 


226 


19 


-0.452 


22.88 




CL 1601-2040 


E 


0.5439 


256 


10 


0.062 


23.83 




CL 1601-2043 


E 


0.5407 


163 


14 


-0.461 


23.13 




CL 1601-2060 


E 


0.5430 


235 


36 


-0.165 


23.43 





Remarks 



Field galaxy" 
Spiral galaxy" 



— AGN(3C295) 



Field early-type" 
Merger" 



"From Small etaL (1997) 

* Velocity dispersion corrected to a 3 "4 diameter aperture at the distance of the Coma cluster. 
^ Circularized effective radius. 

'' Surface brightness at the effective radius, in rest-frame B, corrected for Galactic extinction but not corrected for cosmological surface britgh- 
ness dimming. 
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' Not included in the analysis of the MjL evolution. 



